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MOLECULES FOR DIAGNOSTICS AND THERAPEUTICS 

TECHNICAL FIELD 

The present invention relates to human molecules and to the use of these sequences in the 
5 diagnosis, study, prevention, and treatment of diseases associated with, as well as effects of exogenous 
compounds on, the expression of human molecules. 



BACKGROUND OF THE INVENTION 

The human genome is comprised of thousands of genes, many encoding gene products that 

10 function in the maintenance and growth of the various cells and tissues in the body. Aberrant 

expression or mutations in these genes and their products is the cause of, or is associated with, a variety 
of human diseases such as cancer and other cell proliferative disorders, autoimmune/inflammatory 
disorders, infections, developmental disorders, endocrine disorders, metabolic disorders, neurological 
disorders, gastrointestinal disorders, transport disorders, and connective tissue disorders. The 

1 5 identification of these genes and their products is the basis of an ever-expanding effort to find markers 
for early detection of diseases, and targets for their prevention and treatment Therefore, these genes 
and their products are useful as diagnostics and therapeutics. These genes may encode, for example, 
enzyme molecules, molecules associated with growth and development, biochemical pathway molecules, 
extracellular information transmission molecules, receptor molecules, intracellular signaling molecules, 

2 o membrane transport molecules, protein modification and maintenance molecules, nucleic acid synthesis 
and modification molecules, adhesion molecules, antigen recognition molecules, secreted and 
extracellular matrix molecules, cytoskeletal molecules, ribosomal molecules, electron transfer 
associated molecules, transcription factor molecules, chromatin molecules, cell membrane molecules, 
and organelle associated molecules. • 

25 For example, cancer represents a type of cell proliferative disorder that affects nearly every . 

tissue in the body. A wide variety of molecules, either aberrantly expressed or mutated, can be the 
cause of, or involved with, various cancers because tissue growth involves complex and ordered 
patterns of cell proliferation, cell differentiation, and apoptosis. Cell proliferation must be regulated to 
maintain both the number of cells and their spatial organization. This regulation depends upon the 

30 appropriate expression of proteins which control cell cycle progression in response to extracellular 
signals such as growth factors and other mitogens, and intracellular cues such as DNA damage or. 
nutrient starvation. Molecules which directly or indirectly modulate cell cycle progression fall into 
several categories, including growth factors and their receptors, second messenger and signal 
transduction proteins, oncogene products, tumor-suppressor proteins, and mitosis-promoting factors. 
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Aberrant expression or mutations in any of these gene products can result in cell proliferative disorders 
such as cancer. Oncogenes are genes generally derived from normal genes that, through abnormal 
expression or mutation, can effect the transformation of a normal cell to a malignant one (oncogenesis). 
Oncoproteins, encoded by oncogenes, can affect cell proliferation in a variety of ways and include 
5 growth factors, growth factor receptors, intracellular signal transducers, nuclear transcription factors, 
and cell-cycle control proteins. In contrast, tumor-suppressor genes are involved in inhibiting cell 
proliferatioa Mutations which cause reduced function or loss of function in tumor-suppressor genes 
result in aberrant cell proliferation and cancer. Although many different genes and their products have 
been found to be associated with cell proliferative disorders such as cancer, many more may exist that 
10 are yet to be discovered 

DNA-based arrays can provide a simple way to explore the expression of a single polymorphic 
' gene or a large number of genes. When the expression of a single gene is explored, DNA-based arrays 
' are employed to detect the expression of specific gene variants. For example, a p53 tumor suppressor • 
gene array is used to detennine whether individuals are carrying mutations that predispose them to • 
is, cancer. A cytochrome p450 gene array is useful to determine whether individuals have one of a number 
of specific mutations that could result in increased drug metabolism, drug resistance or drug toxicity. 

DNA-based array technology is especially relevant for the rapid screening of expression of a 
large number of genes. There is a growing awareness that gene expression is affected in a global 
fashion. A genetic predisposition, disease or therapeutic treatment may affect, directly or indirectly, the 
2 o expression of a large number of genes. In some cases the interactions may be expected, such as when 
the genes are part of the same signaling pathway. In other cases, such as when the genes participate in - 
separate signaling pathways, the interactions may be totally unexpected. Therefore, DNA-based arrays 
can be used to investigate how genetic predisposition, disease, or therapeutic treatment affects the 
expression of a large number of genes. 

25 

Enzyme Molecules 

Hie cellular processes of biogenesis and biodegradation involve a number of key enzyme 
classes including oxidoreductases, transferases, hydrolases, lyases, isomerases, and ligases. These' 
enzyme classes are each comprised of numerous substrate-specific enzymes having precise and well 
30 regulated functions. These enzymes function by facilitating metabolic processes such as glycolysis, 
the tricarboxylic cycle, and fatty acid metabolism; synthesis or degradation of amino acids, steroids, 
phospholipids, alcohols, etc.; regulation of cell signalling, proliferation, inflamation, apoptosis, etc., 
and through catalyzing critical steps in DNA replication and repair, and the process of translation. 
Oxidoreductases 

2 
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Many pathways of biogenesis and biodegradation require oxidoreductase (dehydrogenase or 
reductase) aptivity, coupled to the reduction or oxidation of a donor or acceptor cofactor. Potential 
cofactors include cytochromes, oxygen, disulfide, iron-sulfur proteins, flavin adenine dinucleotide 
(FAD), and the nicotinamide adenine dinucleotides NAD and N ADP (Newsholme, E. A. and A.R. 
5 Leech (1983) Biochemistry for the Medical Sciences, John Waty and Sons, Chichester, U.K., pp. 
779-793). Reductase activity catalyzes the transfer of electrons between substrate(s) and cofactor(s) 
with concurrent oxidation of the cof actor. The reverse dehydrogenase reaction catalyzes the reduction 
of % cofactor and consequent oxidation of the substrate. Oxidoreductase enzymes are a broad . 
superfamily of proteins that catalyze numerous reactions in all cells of organisms ranging from 

10 bacteria to. plants to humans. These reactions include metabolism of sugar, certain detoxification 
reactions in the liver, find the synthesis or degradation of fatty acids, amino acids, glucocorticoids, 
estrogens, androgens, and prostaglandins. Different family .members are named according to the 
direction in which their reactions are typically catalyzed; thus they may be referred to as . 
oxidoreductases, oxidases, reductases, or dehydrogenases. In addition, family members often have 

1 5 distinct cellular localizations, including the cytosol, the plasma membrane, mitochondrial inner or 
outer membrane, and peroxisomes. 

Short-chain alcohol dehydrogenases (SCADs) are a family of dehydrogenases that only share 
15% to 30% sequence identity, with similarity predominantly in the coenzyme binding domain and 
the substrate binding domain. In addition to the well-known role in detoxification of ethanol, SCADs 

20 are also involved in synthesis and degradation of fatty acids, steroids, and some prostaglandins, and 
are therefore implicated in a variety of disorders such as lipid storage disease, myopathy, SCAD 
deficiency, and certain genetic disorders. For example, retinol dehydrogenase is a SCAD-family 
member (Simon, A. et al. (1995) J. Biol. Chem. 270:1107-1112) that converts retinol toTfetinal, the 
precursor of.retinoic acid. Retinoic acid, a regulator of differentiation and apoptosis, has been shown 

25 to down-regulate genes involved in cell proliferation and inflammation (Chai, X. et al; (1995) J. Biol. 
Chem. 270:3900-3904). In addition, retinol dehydrogenase has been linked to hereditary eye diseases 
such as autosomal recessive childhood-onset severe retinal dystrophy <Simon, A. et al: (1996) 
Genomics 36:424-430). 

Propagation of nerve impulses, modulation of cell proliferation and differentiation, induction 

30 of the immune response, and tissue homeostasis involve neurotransmitter metabolism (Weiss, B. 
(1991) Neurotoxicology 12:379-386; Collins, S.M. et al. (1992) Ann. N.Y. Acad. Sci. 664:415-424; 
Brown, J.K. and H. Imam (1991) J. Inherit. Metab. Dis. 14:436-458). Many pathways of 
neurotransmitter metabolism require oxidoreductase activity, coupled to reduction or oxidation of a 
cofactor, such as NADVNADH (Newsholme, E.A. and A.R. Leech<1983) Biochemistry for the 
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Medical Sciences . John Wiley and Sons, Chichester, U.K. pp. 779-793). Degradation of 
catecholamines (epinephrine or norepinephrine) requires alcohol dehydrogenase (in the brain) or 
aldehyde dehydrogenase (in peripheral tissue). NAD + -dependent aldehyde dehydrogenase oxidizes 5- 
hydroxyindole-3-acetate (the product of 5-hydroxytryptamine (serotonin) metabolism) inthe brain, 
5 • blood platelets* liver and pulmonary endothelium (Newsholme. supra, p. 786). Other 

neurotransmitter degradation pathways that utilize NAD + /NADH-dependent oxidoreductase activity 
include those of L-DOPA (precursor of dopamine, a neuronal excitatory compound), glycine (an 
, inhibitory neurotransmitter in the brain and spinal, cord), histamine (liberated from mast cells during 
. the inflammatory response), and taurine (an inhibitory neurotransmitter of the brain stem, spinal cord 
1,0.. and retina) (Newsholme, supra , pp. 790. 792). Epigenetic or genetic defects in neurotransmitter 

metabolic.pathways can result in a spectrum of disease states in different tissues including Parkinson 
, \ ; : . . disease and inherited myoclonus. (McGance, KX. and S.E. Huether (1994) Pathophysiology. Mosby- 
Year Book, Inc., St Louis MO, pp. 402-404; Gundlach, A.L. (1990) FASEB J. 4:2761-2766). 

Tetrahydrofolate is a derivatized glutamate molecule that acts as a carrier, providing activated 
15 . ; one-carbon units to a wide variety of biosynthetic reactions, including synthesis of purines, 
pyrimidines, and the amino acid methionine. Tetrahydrofolate is generated by the activity of a 
holoenzyme complex called tetrahydrofolate synthase, which includes three enzyme activities: 
tetrahydrofolate dehydrogenase, tetrahydrofolate cyclohydrolase, and tetrahydrofolate synthetase. 
Thus, tetrahydrofolate dehydrogenase plays an important role in generating building blocks for 
20 nucleic and amino acids, crucial to proliferating cells. 

3-Hydroxyacyl-CoA dehydrogenase (3HACD) is involved in fatty acid metabolism. It 
catalyzes the reduction of 3-hydroxyacyl-CoA to 3-oxoacyl-CoA, with concomitant oxidation of 
NAD to NADH, in the mitochondria and peroxisomes of eukaryotic cells. In peroxisomes, 3HACD 
and enoyl-CoA hydratase form an enzyme complex called bifunctional enzyme, defects in which are 
.25 associated with peroxisomal bifunctional enzyme deficiency. This interruption in fatty acid 

metabolism produces accumulation of very-long chain fatty acids, disrupting development of the 
brain, bone, and adrenal glands. Infants born with this deficiency typically die within 6 months 
(Watkins, P. et al. (1989) J. Clin. Invest 83:771-777; Online Mendelian Inheritance in Man (OMIM)> 
#261515). The neurodegeneration that is characteristic of Alzheimer's disease involves development 
30 of extracellular plaques in certain brain regions. A major protein component of these plaques is the 
peptide amyloid-P (AP), which is one of several cleavage products of amyloid precursor protein 
(APP). 3HACD has been shown to bind the AP peptide, and is overexpressed in neurons affected in 
Alzheimer's disease. In addition, an antibody against 3HACD can block the toxic effects of AP in a 
cell culture model of Alzheimer's disease (Yan, S. et al. (1997) Nature 389:689-695; OMIM, 

4 
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I 1 

Steroids, such as estrogen, testosterone, corticosterone, and others, are generated from a 
common precursor, cholesterol, and are interconverted into one another. A wide variety of enzymes 
act upon cholesterol, including a numher of dehydrogenases. Steroid dehydrogenases, such as the 
5 hydroxysteroid dehydrogenases, are involved in hypertension, fertility, and cancer (Duax, WX. and 
i D. Ghosh (1997) Steroids 62:95-100). One such dehydrogenase is 3-oxo-5-a-steroid dehydrogenase 

(OASD), a microsomal membrane protein highly expressed in prostate and other androgen-responsive 
tissues. OASD catalyzes the conversion of testosterone into dihydrotestosterone, which is the most 
potent androgen. Dihydrotestosterone is essential for the formation of the male phenotype during 

10 embryogenesis, as well as for proper androgen-mediated growth of tissues such as the prostate and 
male genitalia, A defect in OASD that prevents the conversion of testosterone into 
dihydrotestosterone leads to a rare form of male pseudohermaphroditis, characterized by defective 
formation of the external genitalia (Andersson, S. et al. (1991) Nature 354:159-161; Labrie, F. et al. 

' (1992) Endocrinology 131 :1571-1573; OMIM #264600). Thus, OASD plays a central role in sexual 

15 differentiation and androgen physiology. - 
17p-hydroxysteroid dehydrogenase (17PHSD6) plays an important role in the regulation of 
the male reproductive hormone, dihydrotestosterone (DHTT). 17PHSD6 acts to reduce levels of 
DHTT by oxidizing a precursor of DHTT, 3a-diol, to androsterone which is readily glucuronidated * 
and removed from tissues. 17PHSD6 is active with both androgen and estrogen substrates when 

2 o expressed in embryonic kidney 293 cells. At least five other isozymes of 17 PHSD have been 

identified that catalyze oxidation and/or reduction reactions in various tissues with preferences for 
different steroid substrates (Biswas, M.G. and D.W. Russell (1997) J. Biol. Chem. 272:15959- 
15966). For example, 17pHSDl preferentially reduces estradiol and is abundant in the ovary and 
placenta. 17PHSD2 catalyzes oxidation of androgens and is present in the endometrium and placenta. 

2 5 17PHSD3 is exclusively a reductive enzyme in the testis (Geissler, W.M. et al. (i994) Nat. Genet 

7:34-39). An excess of androgens such as DHTT can contribute to certain disease states such as 
benign prostatic hyperplasia and prostate cancer. 

Oxidoreductases are components of the fatty acid metabolism pathways in mitochondria and 
peroxisomes. The main beta-oxidation pathway degrades both saturated and unsaturated fatty acids, 

3 o while the auxiliary pathway performs additional steps required for the degradation of unsaturated fatty 

acids. The auxiliary beta-oxidation enzyme 2,4-dienoyl-CoA reductase catalyzes the removal of 
even-numbered double bonds from unsaturated fatty acids prior to their entry into the main beta- 
oxidation pathway. The enzyme may also remove odd-numbered double bonds from unsaturated 
fatty acids (Koivuranta, K.T. et al. <1994) Biochem. J. 304:787-792; Smeland, T.E. et al. (1992) Proc. 

5 
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Natl. Acad Sci. USA 89:6673-6677).' 2,4-dienoyl-CoA reductase is located in both mitochondria and 
peroxisomes. Inherited deficiencies in mitochondrial and peroxisomal beta-oxidation enzymes are 
associated ufith severe diseases, some of which manifest themselves soon after birth and lead to death 
within a few years. Defects in beta-oxidation are associated with Reye's syndrome, Zellweger 
5 syndrome, neonatal adrenoleukodystrophy, infantile Refsum's disease, acyl-CoA oxidase deficiency, 
and bifunctional protein deficiency (Suzuki, Y. et al. (1994), Am. J. Hum. Genet 54:36-43; Hoefler, 
supra : Cotran, R.S. et al. (1994) Robbins Pathologic Basis of Disease. W.B. Saunders Co., 
Philadelphia PA, p.866). Peroxisomal beta-oxidation is impaired in cancerous tissue. Although 

neoplastic human breast epithelial cells have the same number of peroxisomes as do normal cells, 

i 

10 fatty acyl-CoA oxidase activity is lower than in control tissue (el Bouhtoury, F. et ai (1992) J. Pathol. 
166:27-35). Human colon carcinomas have fewer peroxisomes than normal colon tissue and have 
lower fatty-acyl-CoA,oxidase and bifunctional enzyme (including enoyl-CoA hydratase) activities 
than normal tissue (Cable, S. et al. (1992) Virchows Arch. B Cell Pathol. Incl. Mol. Pathol. 62:221- 
226). Another important oxidoreductase is isocitrate dehydrogenase, which catalyzes the conversion 

15 of isocitrate to a-ketoglutarate, a substrate of the citric acid cycle. Isocitrate dehydrogenase can be 
either NAD or NADP dependent, and is found in the cytosol, mitochondria, and peroxisomes. 
Activity of isocitrate dehydrogenase is regulated developmentally, and by hormones, 
neurotransmitters, and growth factors. 

Hydroxypyruvate reductase (HPR), a peroxisomal 2-hydroxyacid dehydrogenase in the 

2 o glycolate pathway, catalyzes the conversion of hydroxypyruvate to glycerate with the oxidation of 

both NADH and NADPH. The reverse dehydrogenase reaction reduces NAD* and NADP*. HPR 
recycles nucleotides and bases back into pathways leading to the synthesis of ATP and GTP. ATP 
and GTP are used to produce DNA and RNA and to control various aspects of signal transduction and 
energy metabolism. Inhibitors of purine nucleotide biosynthesis have long been employed as 
. 25 antiproliferative agents to treat cancer and viral diseases. HPR also regulates biochemical synthesis 
of serine and cellular serine levels available for protein synthesis. 

The mitochondrial electron transport (or respiratory) chain is a series of oxidoreductase-type 
enzyme complexes in the mitochondrial membrane that is responsible for the transport of electrons 
from NADH through a series of redox centers within these complexes to oxygen, and the coupling of 

3 o this oxidation to the synthesis of ATP (oxidative phosphorylation). ATP then provides the primary 

source of energy for driving a cell's many energy-requiring reactions. The key complexes in the 
respiratory chain are NADH:ubiquinone oxidoreductase (complex I), succinate:ubiquinone 
oxidoreductase (complex II), cytochrome q-b oxidoreductase (complex IE), cytochrome c oxidase 
(complex IV), and ATP synthase (complex V) (Alberts, B. et al. (1994) Molecular Biology of the 

6 
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Cell . Garland Publishing, Inc., New York NY, pp. 677-678). All of these complexes are located on 
the inner majrix side of the mitochondrial membrane except complex n, which is on the cytosolic 
side. Complex II transports electrons generated in the citric acid cycle to the respiratory chain. The 
electrons generated by oxidation of succinate to fumarate in the citric acid cycle are transferred 
5 through electron carriers in complex n to membrane bound ubiquinone (Q). Transcriptional 

regulation of these nuclear-encoded genes appears to be the predominant means for controlling the 
biogenesis of respiratory enzymes. Defects and altered expression of enzymes in the respiratory chain 
are associated with a variety of disease conditions. 

i 

Other dehydrogenase activities using NAD as a cof actor are also important in mitochondrial 
10 function. 3-hydroxyisobutyrate dehydrogenase (3HBD), important in valine catabolism, catalyzes the 
NAD-dependent oxidation of 3-hydroxyisobutyrate to methylmalonate semialdehyde within 
mitochondria. Elevated levels of 3-hydroxyisobutyrate have been reported in a number of disease 
states, including ketoacidosis, methylmalonic acidemia, and other disorders associated with 
deficiencies in methylmalonate semialdehyde dehydrogenase (Rougraff, P.M. et al. (1989) J. Biol. 
15 Chem. 2643899-5903). 

Another mitochondrial dehydrogenase important in amino acid metabolism is the enzyme 
isovaleryl-CoA-dehydrogenase (TVD). IVD is involved in leucine metabolism and catalyzes the 
oxidation of isovaleryl-CoA to 3-metliylcrotonyl-CoA. Human IVD is a tetrameric flavoproteih that 
is encoded in the nucleus and synthesized in the cytosol as a 45 kDa precursor with a mitochondrial 
2 o import signal sequence. A genetic deficiency, caused by a mutation in the gene encoding IVD, results 
in the condition known as isovaleric acidemia. This mutation results in inefficient mitochondrial 
import and processing of the IVD precursor (Vockley, J. et al. (1992) J. Biol. Chem. 267:2494-2501). 
Transferases 

Transferases are enzymes that catalyze the transfer of molecular groups. The reaction may 

2 5 involve an oxidation, reduction, or cleavage of covalent bonds, and is often specific to a substrate or 

to particular sites on a type of substrate. Transferases participate in reactions essential to such 
functions as synthesis and degradation of cell components, regulation of cell functions including cell 
signaling, cell proliferation, inflamation, apoptosis, secretion and excretion. Transferases are 
involved in key steps in disease processes involving these functions. Transferases are frequently 

3 o classified according to the type of group transferred. For example, methyl transferases transfer one- 

carbon methyl groups, amino transferases transfer nitrogenous amino groups, and similarly 
denominated enzymes transfer aldehyde or ketone, acyl, glycosyl, alkyl or aryl, isoprenyl, saccharyl, 
phosphorous-containing, sulfur-containing, or selenium-containing groups, as well as small 
enzymatic groups such as Coenzyme A. 
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Acyl transferases include peroxisomal carnitine octanoyl transferase, which is involved in the 
fatty acid beta-oxidation pathway, and mitochondrial carnitine palmitoyl transferases, involved in 
fatty acid metabolism and transport Choline O-acetyl transferase catalyzes the biosynthesis of the 
neurotransmitter acetylcholine. 
5 Amino transferases play key roles in protein synthesis and degradation, and they contribute to 

other processes as well. For example, the amino transferase 5-aminolevulinic acid synthase catalyzes 
, the addition of succinyl-CoA to glycine, the first step in heme biosynthesis. Other amino transferases 

participate in pathways important for neurological function and metabolism. For example, glutamine- 
phenylpyruvate amino transferase, also known as glutamine transaminase K (GTK), catalyzes several 

1 o reactions with a pyridoxal phosphate cof actor. GTK catalyzes the reversible conversion of L- 

glutamine and phenylpyruvate to 2-oxoglutaramate and L-phenylalanine. Other amino acid substrates 
for GTK include L-methionine, L-histidine, and L-tyrosine. GTK also catalyzes the conversion of 
kynurenine to kynurenic acid, a tryptophan metabolite that is an antagonist of the N-methyl-D- 
< aspartate (NMDA) receptor in the brain and may exert a neuromodulator functioa Alteration of the 

1 15 kynurenine metabolic pathway may be associated with several neurological disorders. GTK also 
plays a role in the metabolism of halogenated xenobiotics conjugated to glutathione, leading to 
nephrotoxicity in rats and neurotoxicity in humans. GTK is expressed in kidney, liver, and brain. 
Both human and rat GTKs contain a putative pyridoxal phosphate binding site (ExPASy ENZYME: 
EC 2.6.1.64; Peny, S.J. et al. (1993) Mol. Pharmacol. 43:660-665; Perry, S. et al. (1995) FEBS Lett 

20 360:277-280; and Alberati-Giani, D. et al. (1995) J. Neurochem. 64:1448-1455). A second amino 
transferase associated with this pathway is kynurenine/a-aminoadipate amino transferase (AadAT). 
AadAT catalyzes the reversible conversion of a-aminoadipate and a-ketoglutarate to a-ketoadipate 
and L-glutamate during lysine metabolism. AadAT also catalyzes the transamination of kynurenine 
to kynurenic acid. A cytosolic AadAT is expressed in rat kidney, liver, and brain (Nakatani, Y. et al. 

25 (1970) Biochim. Biophys. Acta 198:219-228; Buchli, R. et al. (1995) J. Biol. Chem. 270:29330- 
29335). 

Glycosyl transferases include the mammalian UDP-glucouronosyl transferases, a family of 
membrane-bound microsomal enzymes catalyzing the transfer of glucouronic acid to lipophilic 
substrates in reactions that play important roles in detoxification and excretion of drugs, carcinogens, 
30 and other foreign substances. Another mammalian glycosyl transferase, mammalian UDP-galactose- 
ceramide galactosyl transferase, catalyzes the transfer of galactose to ceramide in the synthesis of 
galactocerebrosides in myelin membranes of the nervous system. The UDP-glycosyl transferases 
share a conserved signature domain of about 50 amino acid residues (PROSITE: PDOC00359, 
htQ)://expasy.hcuge.ch/sprot/prositeJitml). 
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Methyl transferases are involved in a variety of pharmacologically important processes. 
Nicotinamide N-methyl transferase catalyzes the N-methylation of nicotinamides and other pyridines, 
an important step in the cellular handling of drugs and other foreign compounds. 
Phenylethanolamine N-methyl transferase catalyzes the conversion of noradrenalin to adrenalin. 6-6- 
5 methylguanine-DNA methyl transferase reverses DNA methylation, an important step in 

carcinogenesis. Uropoiphyrin-in C-methyl transferase, which catalyzes the transfer of two methyl 
groups from S-adenosyl-L-methionine to uroporphyrinogen HI, is the first specific enzyme in the 
biosynthesis of cobalamin, a dietary enzyme whose uptake is deficient in pernicious anemia. Protein- 
arginine methyl transferases catalyze the postradiational methylation of arginine residues in 

10 proteins, resulting in the mono- and dimethylation of arginine on the guanidino group. Substrates 
include histones, myelin basic protein, and heterogeneous nuclear ribonucleoproteins involved in 
mRNA processing, splicing, and transport Protein-arginine methyl transferase interacts with proteins 
upregulated by mitogens, with proteins involved in chronic lymphocytic leukemia, and with 

' interferon, suggesting an important role for methylation in cytokine receptor signaling (Lin, W.-J. et 

15 al. (1996) J. Biol. Chem. 271:15034-15044; Abramovich, C. et al. (1997) EMBO J. 16:260-266; and 
Scott, H.S. et al. (1998) Genomics 48:330-340). 

Phosphotransferases catalyze the transfer of high-energy phosphate groups and are important 
in energy-requiring and -releasing reactions. The metabolic enzyme creatine kinase catalyzes the 
reversible phosphate transfer between creatine/creatine phosphate and ATP/ADP. Glycocyamine 

2 o kinase catalyzes phosphate transfer from ATP to guanidoacetate, and arginine kinase catalyzes 

phosphate transfer from ATP to arginine. A cysteine-containing active site is conserved in this family 
(PROS1TE: PDOC00103). 

Prenyl transferases are heterodimers, consisting of an alpha and a beta subunit, that catalyze 
the transfer of an isoprenyl group. An example of a prenyl transferase is the mammalian protein 

2 5 f arnesyl transferase. The alpha subunit of f arnesyl transferase consists of 5 repeats of 34 amino acids 
each, with each repeat containing an invariant tryptophan (PROSITE: PDOC00703). 

Saccharyl transferases are glycating enzymes involved in a variety of metabolic processes. 
Oligosacchryl transferase-48, for example, is a receptor for advanced glycation endproducts. 
Accumulation of these endproducts is observed in vascular complications of diabetes, macrovascular 

30 disease, renal insufficiency, and Alzheimer's disease (Thornalley, P.J. (1998) Cell Mol. Biol. (Noisy- 
Le-Grand) 44:1013-1023). 

Coenzyme A (CoA) transferase catalyzes the transfer of CoA between two carboxylic acids. 
Succinyl CoA'3-oxoacid CoA transferase, for example, transfers CoA from succinyl-CoA to a 
recipient such as acetoacetate. Acetoacetate is essential to the metabolism of ketone bodies, which 
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accumulate in tissues affected by metabolic disorders such as diabetes (PROSITE: PDOC00980). 
Hydrolases 

Hydrolysis is the breaking of a covalent bond in a substrate by introduction of a molecule of 
water. The reaction involves a nucleophilic attack by the water molecule's oxygen atom on a target 
5 . bond in the substrate. The water molecule is split across the target bond, breaking the bond and 
generating two product molecules. Hydrolases participate in reactions essential to.such functions as 
synthesis and degradation of cell components, and for regulation of cell functions including cell 
signaling, cell proliferation, inflamation, apoptosis, secretionand excretion. Hydrolases are involved 
in key steps in disease processes involving these functions. Hydrolytic enzymes, or hydrolases, may 

10 be grouped by substrate specificity into classes including phosphatases, peptidases, 
lysophospholipases, phosphodiesterases, glycosidases, and glyoxalases. 

Phosphatases,hydrolytically remove phosphate groups from proteins, an energy-providing 
step that regulates many cellular processes, including intracellular signaling pathways that in turn 
control cell growth and differentiation, cell-cell contact, the cell cycle, and oncogenesis. 

15 Lysophospholipases (LPLs) regulate intracellular lipids by catalyzing the hydrolysis of ester 

bonds to remove an acyl group, a key step in lipid degradation. Small LPL isoforms, approximately 
15-30 kD, function as hydrolases; larger isoforms function both as hydrolases and transacylases. A 
particular substrate for LPLs, lysophosphatidylcholine, causes lysis of cell membranes. LPL activity 
is regulated by signaling molecules important in numerous pathways, including the inflammatory 

20 response. 

Peptidases, also called proteases, cleave peptide bonds that form the backbone of peptide or 
protein chains. Proteolytic processing is essential to cell growth, differentiation, remodeling, and 
homeostasis as well as inflammation and immune response. Since typical protein half-lives range 
from hours to a few days, peptidases are continually cleaving precursor proteins to their active form, 

2 5 removing signal sequences from targeted proteins, and degrading aged or defective proteins. 

Peptidases function in bacterial, parasitic, and viral invasion and replication within a host. Examples 
of peptidases include trypsin and chymotrypsin (components of the complement cascade and the 
blood-clotting cascade) lysosomal cathepsins, calpains, pepsin, renin, and chymosin (Beynon, R. J. 
and J.S. Bond (1994) Proteolytic Enzymes: A Practical Approach, Oxford University Press, New 

30 York NY, pp. 1-5). 

The phosphodiesterases catalyze the hydrolysis of one of the two ester bonds in a 
phosphodiester compound. Phosphodiesterases are therefore crucial to a variety of cellular processes. 
Phosphodiesterases include DNA and RNA endo- and exo-nucleases, which are essential to cell 
growth and replication as well as protein synthesis. Another phosphodiesterase is acid 

10 
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sphingomyelinase, which hydrolyzes the membrane phospholipid sphingomyelin to ceramide and 
phosphorylcholine. Phos£horylcholine is used in the synthesis of phosphatidylcholine, which is 
involved in numerous intracellular signaling pathways. Ceramide is an essential precursor for the 
generation of gangliosides, membrane lipids found in high concentration in neural tissue. Defective 
5 acid sphingomyelinase phosphodiesterase leads to a build-up of sphingomyelin molecules in 
1 lysosomes, resulting in Niemann-Pick disease. 

Glycosidases catalyze the cleavage of hemiacetyl bonds of glycosides, which are compounds 
that contain one or more sugar. Mammalian lactase-phlorizin hydrolase, for example, is an intestinal 
enzyme that splits lactose. Mammalian beta-galactosidase removes the terminal galactose from 

10 gangliosides, glycoproteins, and glycosaminoglycans, and deficiency of this enzyme is associated 
with a gangliosidosis known as Morquio disease type B. Vertebrate lysosomal alpha-glucosidase, 
which hydrolyzes glycogen, maltose, and isomaltose, and vertebrate intestinal sucrase-isomaltase, 
which hydrolyzes sucrose, maltose, and isomaltose, are widely distributed members of this family 
with highly conserved sequences at their active sites. 

1 5 The glyoxylase system is involved in gluconeogenesis, the production of glucose from 

storage compounds in the body. It consists of glyoxylase I, which catalyzes the formation of S-D- 
lactoylglutathione from methyglyoxal, a side product of triose-phosphate energy metabolism, and 
glyoxylase II, which hydrolyzes S-D-lactoylglutathione to D-lactic acid and reduced glutathione. 
Glyoxylases are involved in hyperglycemia, non-insulin-dependent diabetes mellitus, the 

20 detoxification of bacterial toxins, and in the control of cell proliferation and microtubule assembly. 
Lyases 

Lyases are a class of enzymes that catalyze the cleavage of C-C, C-O, C-N, C-S, C-(halide), 

P-O or other bonds without hydrolysis or oxidation to form two molecules, at least one of which 

i 

contains a double bond (Stryer, L. (1995) Biochemistry W.H. Freeman and Co. New York, NY 
25 p.620). Lyases are critical components of cellular biochemistry with roles in metabolic energy 

production including fatty acid metabolism, as well as other diverse enzymatic processes. Further 

classification of lyases reflects the type of bond cleaved as well as the nature of the cleaved group. 
The group of C-C lyases include carboxyl-lyases (decarboxylases), aldehyde-lyases 

(aldolases), oxo-acid-lyases and others. The C-O lyase group includes hydro-lyases, lyases acting on 
30 polysaccharides and other lyases. Hie C-N lyase group includes ammonia-lyases, amidine-lyases, 

amine-lyases (deaminases) and other lyases. 

Proper regulation of lyases is critical to normal physiology. For example, mutation induced 

deficiencies in the uroporphyrinogen decarboxylase can lead to photosensitive cutaneous lesions in 

the genetically-linked disorder familial porphyria cutanea tarda (Mendez, M. et al. (1998) Am. J. 

11 
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Genet 63:1363-1375). It has also been shown that adenosine deaminase (ADA) deficiency stems 
from genetic mutations in the ADA gene, resulting in the disorder severe combined 
immunodeficiency disease (SCID) (Hershfield, M.S. (1998) Semin. Hematol. ,35:291-298). 
Isomerases 

5 Isomerases are a class of enzymes that catalyze geometric or structural changes within a 

molecule to form a single product This class includes racemases and epimerases, cis-trans- 
isomerases, intramolecular oxidoreductases, intramolecular transferases (mutases) and intramolecular 
lyases. Isomerases are critical components of cellular biochemistry with roles in metabolic energy 
production including glycolysis, as well as other diverse enzymatic processes (Stryer, L. (1995) 
10 Biochemistry. W.H. Freeman and Co., New York NY, pp.483-507) . 

Racemases are a subset of isomerases that catalyze inversion of a molecules configuration 
. around the asymmetrip carbon atom in a substrate having a single center of asymmetry, thereby 
interconverting two racemers. Epimerases are another subset of isomerases that catalyze inversion of 
configuration around an asymmetric carbon atom in a substrate with more than one cento of 
is symmetry, thereby interconverting two epimers. Racemases and epimerases can act on amino acids 
and derivatives, hydroxy acids and derivatives, as well as carbohydrates and derivatives. The 
. interconversion of UDP-galactose and UDP-glucose is catalyzed by UDP-galactose-4' -epimerase. 
Proper regulation and function of this epimerase is essential to the synthesis of glycoproteins and 
glycolipids. Elevated blood galactose levels have been correlated with UDP-galactose-4* -epimerase 
20 deficiency in screening programs of infants (Gitzelmann, R. (1972) Helv. Paediat Acta 27:125-130). 

Oxidoreductases can be isomerases as well. Oxidoreductases catalyze the reversible transfer 
of electrons from a substrate that becomes oxidized to a substrate that becomes reduced. This class of 
enzymes includes dehydrogenases, hydroxylases, oxidases, oxygenases, peroxidases, and reductases. 
Proper maintenance of oxidoreductase levels is physiologically important For example, genetically- 

2 5 linked deficiencies in lipoamide dehydrogenase can result in lactic acidosis (Robinson, B.H. et al. 

(1977) Pediat Res. 11:1198-1202). 

Another subgroup of isomerases are the transferases (or mutases). Transferases transfer a 
chemical group from one compound (the donor) to another compound (the acceptor). The types of 
groups transferred by these enzymes include acyl groups, amino groups, phosphate groups 

3 o (phosphotransferases or phosphomutases), and others. The transferase carnitine palmitoyltransferase 

is an important component of fatty acid metabolism. Genetically-linked deficiencies in this 
transferase can lead to myopathy (Scriver, C.R. et al. (1995) The Metabolic and Molecular Basis of 
Inherited Disease . McGraw-Hill, New York NY, pp.1501-1533). 

Yet another subgroup of isomerases are the topoisomersases. Topoisomerases are enzymes 

12 



BNSDOCJDi <WO 01 62927 A2_L> 



WO 01/62927 PCT/US01/06059 

that affect the topological state of DNA. For example, defects in topoisomerases or their regulation 
can affect normal physiology. Reduced levels of topoisomerase II have been correlated with some of 
the DNA processing defects associated with the disorder ataxia-telangiectasia. (Singh, S.P. et al. 
(1988) Nucleic Acids Res. 16:3919-3929). 
5 Lipases 

Ligases catalyze the formation of a bond between two substrate molecules. The process 
involves the hydrolysis of a pyrophosphate bond in ATP or a similar energy donor. Ligases are 
classified based on the nature of the type of bond they form, which can include carbon-oxygen, 
carbon-sulfur, carbon-nitrogen, carbon-carbon and phosphoric ester bonds. 

10 Ligases forming carbon-oxygen bonds include the aminoacyl-transfer RNA (tRNA) 

synthetases which are Important RNA-associated enzymes with roles in translation. Protein 
biosynthesis depends on each amino acid forming a linkage with the appropriate tRNA. The 
aminoacyl-tRNA synthetases are responsible for the activation and correct attachment of an amino 
acid with its cognate tRNA. The 20 aminoacyl-tRNA synthetase enzymes can be divided into two 

15 structural classes, and each class is characterized by a distinctive topology of the catalytic domain. 
Class I enzymes contain a catalytic domain based on the nucleotide-binding Rossman fold. Class II 
enzymes contain a central catalytic domain, which consists of a seven-stranded antiparallel 6-sheet 
motif, as well as N- and C- terminal regulatory domains. Class II enzymes are separated into two 
groups based on the heterodimeric or homodimeric structure of the enzyme; the latter group is further 

20 subdivided by the structure of the N- and C-terminal regulatory domains (Harflein, M. and S. Gusack 
(1995) J. Mol. Evol. 40:519-530). Autoantibodies against aminoacyl-tRNAs are generated by 
patients with dermatomyositis and polymyositis, and correlate strongly with complicating interstitial 
lung disease (ILD). These antibodies appear to be generated in response to viral infection, and 
coxsackie virus has been used to induce experimental viral myositis in animals. 

2 5 Ligases forming carbon-sulfur bonds (Acid-thiol ligases) mediate a large number of cellular 

biosynthetic intermediary metabolism processes involve intermolecular transfer of carbon 
atom-containing substrates (carbon substrates). Examples of such reactions include the tricarboxylic 
acid cycle, synthesis of fatty acids and long-chain phospholipids, synthesis of alcohols and aldehydes, 
synthesis of intermediary metabolites, and reactions involved in the amino acid degradation 

3 o pathways. Some of these reactions require input of energy, usually in the form of conversion of ATP 

to either ADP or AMP and pyrophosphate. 

In many cases, a carbon substrate is derived from a small molecule containing at least two 
carbon atoms. The carbon substrate is often covalently bound to a larger molecule which acts as a 
carbon substrate carrier molecule within the cell. In the biosynthetic mechanisms described above, 

13 
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the carrier molecule is coenzyme A. Coenzyme A (CoA) is structurally related to derivatives of the 
nucleotide ADP and consists of 4'-phosphopantetheine linked via a phosphodiester bond to the alpha 
phosphate group of adenosine 3\5'-bisphosphate. The terminal thiol group of 4'-phosphopantetheine 
acts as the site for carbon substrate bond formation. The predominant carbon substrates which utilize 
5 CoA as a carrier molecule during biosynthesis and intermediary metabolism in the cell are acetyl, 
succinyl, and propionyl moieties, collectively referred, to as acyl groups. Other carbon substrates 
include enoyl lipid, which acts as a fatty acid oxidation intermediate, and carnitine, which acts as an 
acetyl-CoA flux regulator/ mitochondrial acyl group transfer protein. Acyl-CoA and acetyl-CoA are 
synthesized in the cell by acyl-CoA synthetase and acetyl-CoA synthetase, respectively. 

10, Activation of fatty acids is mediated by at least three forms of acyl-CoA synthetase activity: 

i) acetyl-CoA synthetase, which activates acetate and several other low molecular weight carboxylic 
acids and is found in muscle mitochondria and the cytosol of other tissues; ii) medium-chain 
acyl-CoA synthetase, which activates fatty acids containing between four and eleven carbon atoms 
(predominantly from dietary sources), and is present only in liver mitochondria; and iii) acyl CoA 

15 synthetase, which is specific for long chain fatty acids with between six and twenty carbon atoms, and 
is found in microsomes and the mitochondria. Proteins associated with acyl-CoA synthetase activity 
have been identified from many sources including bacteria, yeast, plants, mouse, and man. The 
activity of acyl-CoA synthetase may be modulated by phosphorylation of the enzyme by 
cAMP-dependent protein kinase. 

2 o Ligases forming carbon-nitrogen bonds include amide synthases such as glutamine synthetase 

(glutamate-ammonia ligase) that catalyzes the amination of glutamic acid to glutamine by ammonia 
using the energy of ATP hydrolysis. Glutamine is the primary source for the amino group in various 
amide transfer reactions involved in de novo pyrimidine nucleotide synthesis and in purine and 

pyrimidine ribonucleotide interconversions. Overexpression of glutamine synthetase has- been 

i 

2 5 observed in primary liver cancer (Christa, L et al. (1994) Gastroent 106: 1312-1 320). 

Acid-amino-acid ligases (peptide synthases) are represented by the ubiquitin proteases which 
are associated with the ubiquitin conjugation system (UCS), a major pathway for the degradation of 
cellular proteins in eukaryotic cells and some bacteria. The UCS mediates the elimination of 
abnormal proteins and regulates the half-lives of important regulatory proteins that control cellular 

30 processes such as gene transcription and cell cycle progression. In the UCS pathway, proteins 
targeted for degradation are conjugated to a ubiquitin (Ub), a small heat stable protein. Ub is first 
activated by a ubiquitin-activating enzyme (El), and then transferred to one of several Ub- 
conjugating enzymes (E2). E2 then links the Ub molecule through its C-terminal glycine to an 
internal lysine (acceptor lysine) of a target protein. The ubiquitinated protein is then recognized and 
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degraded by proteasome, a large, multisubunit proteolytic enzyme complex, and ubiquitin is released 
for reutilization by ubiquitin protease. The UCS is implicated in the degradation of mitotic cyclic 
kinases, oncoproteins, tumor suppressor genes such as p53, viral proteins, cell surface receptors 
associated with signal transduction, transcriptional regulators, and mutated or damaged proteins 
5 (Ciechanover, A. (1994) Cell 79:13-21). A murine proto-oncogene, Unp, encodes a nuclear ubiquitin 
protease whose overexpression leads to oncogenic transformation of NIH3T3 cells, and the human 
homolog of this gene is consistently elevated in small cell tumors and adenocarcinomas of the lung 
(Gray, D.A. (1995) Oncogene 10:2179-2183). 

Cyclo-ligases and other carbon-nitrogen ligases comprise various enzymes and enzyme 

i o complexes that participate in the de novo pathways to purine and pyrimidine biosynthesis. Because 
these pathways are critical to the, synthesis of nucleotides for replication of both RNA and DNA, 
many of these enzymes have been the targets of clinical agents for the treatment of cell proliferative 
disorders such as cancer and infectious diseases. 

Purine biosynthesis occurs de novo from the amino acids glycine and glutamine, and other 

15 small molecules. Three of the key reactions in this process are catalyzed by a trifunctional enzyme 
composed of glycinamide-ribonucleotide synthetase (GARS), aminoimidazole ribonucleotide 
synthetase (AIRS), and glycinamide ribonucleotide transformylase (GART). Together these three 
enzymes combine ribosylamine phosphate with glycine to yield phosphoribosyl aminoimidazole, a 
precursor to both adenylate and guanylate nucleotides. This trifunctional protein has been implicated 

20 in the pathology of Downs syndrome (Aimi, J. et al. (1990) Nucleic Acid Res. 18:6665-6672). 

Adenylosuccinate synthetase catalyzes a later step in purine biosynthesis that converts inosinic add to 
adenylosuccinate, a key step on the path to ATP synthesis. Hiis enzyme is also similar to another 
carbon-nitrogen ligase, argininosuccinate synthetase, that catalyzes a similar reaction in the urea cycle 
(Powell, S.M. et al. (1992) FEES Lett. 303:4-10). 

25 Like the de novo biosynthesis of purines, de novo synthesis of the pyrimidine nucleotides 

uridylate and cytidylate also arises from a common precursor, in this instance the nucleotide 
orotidylate derived from orotate and phosphoribosyl pyrophosphate (PPRP). Again a trifunctional 
enzyme comprising three carbon-nitrogen ligases plays a key role in the process. In this case the 
enzymes aspartate transcarbamylase (ATCase), carbamyl phosphate synthetase II, and dihydroorotase 

30 (DHOase) are encoded by a single gene called CAD. Together these three enzymes combine the 
initial reactants in pyrimidine biosynthesis, glutamine, CO^ and ATP to form dihydroorotate, the 
precursor to orotate and orotidylate (Iwahana, H. et al. (1996) Biochem. Biophys. Res. Commun. 
219:249-255). Further steps then lead to the synthesis of uridine nucleotides from orotidylate. 
Cytidine nucleotides are derived from uridine-5' -triphosphate (UTP) by the amidation of UTP using 

15 
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glutamine as the amino donor and the enzyme CTP synthetase. Regulatory mutations in the human 
CTP synthetase are believed to confer multi-drug resistance to agents widely used in cancer therapy 
(Yamauchi, ,M. et al. (1990) EMBO J. 9:2095-2099). 

LigaseS forming carbon-carbon bonds include the carboxylases acetyl-CoA carboxylase and 
5 pyruvate carboxylase. Acetyl-CoA carboxylase catalyzes the carboxylation of acetyl-CoA from COj 
and Kfi using the energy of ATP hydrolysis. Acetyl-CoA carboxylase is the rate-limiting step in the 
biogenesis of long-chain fatty acids. Two isoforms of acetyl-CoA carboxylase, types I and types n, 
are expressed inhuman in a tissue-specific manner (Ha, J. et al. (1994) Eur. J. Biochem. 219:297- 

306}. Pyruvate carboxylase is a nuclear-encoded mitochondrial enzyme that catalyzes the conversion 

i 

10 of pyruvate to oxaloacetate, a key intermediate in the citric acid cycle. 

Ligases forming phosphoric ester bonds include the DNA ligases involved in both DNA 
replication and repair, DNA ligases seal phosphodiester bonds between two adjacent nucleotides in a 
DNA chain using the energy from ATP hydrolysis to first activate the free 5*-phosphate of one 
nucleotide and then react it with the 3'-OH group of the adjacent nucleotide. This resealing reaction 
15 is used in both DNA replication to join small DNA fragments called Okazaki fragments that are 
transiently formed in the process of replicating new DNA, and in DNA repair. DNA repair is the 
process by which accidental base changes, such as those produced by oxidative damage, hydrolytic 
attack, or uncontrolled methylation of DNA, are corrected before replication or transcription of the 
DNA can occur. Bloom's syndrome is an inherited human disease in which individuals are partially 
2 o deficient in DNA ligation and consequently have an increased incidence of cancer (Alberts, B. et al. 
(1994) The Molecular Biology of the Cell . Garland Publishing Inc., New York NY, p. 247). 

Molecules Associated with Growth and Development 

Human growth and development requires the spatial and temporal regulation of cell 
differentiation, cell proliferation, and apoptosis. These processes coordinately control reproduction, 
aging, embryogenesis, morphogenesis, organogenesis, and tissue repair and maintenance. At the 
cellular level, growth and development is governed by the cell's decision to enter into or exit from the 
cell division cycle and by the cell's commitment to a terminally differentiated state. These decisions 
are made by the cell in response to extracellular signals and other environmental cues it receives. The 
following discussion focuses on the molecular mechanisms of cell division, reproduction, cell 
differentiation and proliferation, apoptosis, and aging. 
Cell Division 

Cell division is the fundamental process by which all living things grow and reproduce. In 
unicellular organisms such as yeast and bacteria, each cell division doubles the number of organisms, 

16 

BNSDOCID: <WO 01 62927 A2 | > 



25 



30 



WO 01/62927 PCT/US01/06059 

while in multicellular species many rounds of cell division are required to replace cells lost by wear or 
by programmed cell death, and for cell differentiation to produce a new tissue or organ. Details of the 
cell division cycle may vary, but the basic process consists of three principle events. The first event, 
interphase, involves preparations for cell division, replication of the DNA, and production of essential 
5 proteins. In the second event, mitosis, the nuclear material is divided and separates to opposite sides of 
the cell. The final event, cytokinesis, is division and fission of the cell cytoplasm. The sequence and 
timing of cell cycle transitions is under the control of the cell cycle regulation system which controls the 
process by positive or negative regulatory circuits at various check points. 

Regulated progression of the cell cycle depends on the integration of growth control pathways 

10 with the basic cell cycle machinery. Cell cycle regulators have been identified by selecting for human 
and yeast cDNAs that block or activate cell cycle arrest signals in the yeast mating pherombne pathway 
when they are overexpressed Known regulators include human CPR (cell cycle progression 
restoration) genes, such as CPR8 and CPR2, and yeast CDC (cell division control) genes, including 
CDC91, that block the arrest signals. The CPR genes express a variety of proteins including cyclins, 

15 tumor suppressor binding proteins, chaperones, transcription factors, translation factors, and 
RNA-binding proteins (Edwards, M.C. et al.(1997) Genetics 147:1063-1076). 

Several cell cycle transitions, including the entry and exit of a cell from mitosis, are dependent 
upon the activation and inhibition of cychn-dependent kinases (Cdks). The Cdks are composed of a 
kinase subunit, Cdk, and an activating subunit, cyclin, in a complex that is subject to many levels of 

20 regulation. There appears to be a single Cdk in Saccharomvces cerevisiae and Saccharomyces pombe 
whereas mammals have a variety of specialized Cdks. Cyclins act by binding to and activating 
cyclin-dependent protein kinases which then phosphorylate and activate selected proteins involved in the 
mitotic process. The Cdk-cyclin complex is both positively and negatively regulated by 
phosphorylation, and by targeted degradation involving molecules such as CDC4 and CDC53. In 

2 5 addition, Cdks are further regulated by binding to inhibitors and other proteins -such as Sucl that 

modify their specificity or accessibility to regulators (Patra, D. and W.G. Dunphy (1996) Genes Dev. 

10:1503-1515; and Mathias, N. et al. (1996) Mol. Cell Biol. 16:6634-6643). 

Reproduction 

The male and female reproductive systems are complex and involve many aspects of growth 

3 o and development The anatomy and physiology of the male and female reproductive systems are 

reviewed in (Guyton, A.C. (1991) Textbook of Medical Physiology. W.B. Saunders Co., Philadelphia 
PA, pp. 899-928). 

The male reproductive system includes the process of spermatogenesis, in which the sperm are 
formed, and male reproductive functions are regulated by various hormones and their -effects on 
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accessory sexual organs, cellular metabolism, growth, and otto bodily functions,. 

Spermatogenesis begins at puberty as a result of stimulation by gonadotropic hormones 
released from the anterior pituitary. Immature sperm (spermatogonia) undergo several mitotic cell 
divisions before undergoing meiosis and full maturation. The testes secrete several male sec hormones, 
5 the most abundant being testosterone, that is essential for growth and division of the immature sperm, 
and for the masculine characteristics of the male body., Three other male sex hormones, gonadotropih- 
releasing hormone (GnRH), luteinizing hormone (LH), and follicle-stimulating hormone (FSH) control 
sexual function. 

The uterus, ovaries, fallopian tubes, vagina, and breasts comprise the female reproductive 

1 o system. The ovaries and uterus are the source of ova and the location of fetal development, 

respectively. The fallopian tubes and vagina are accessory organs attached to the top and bottom of the 
uterus, respectively. Both the uterus and ovaries have additional roles in the development and loss of 
reproductive capability during a female's lifetime. The primary role of the breasts is lactation. 
Multiple endocrine signals from the ovaries, uterus, pituitary, hypothalamus, adrenal gjands, and other 
15 , tissues coordinate reproduction and lactation. These signals vary during the monthly menstruation 
cycle and during the female's lifetime. Similarly, the sensitivity of reproductive organs to these 
endocrine signals varies during the female's lifetime. 
• A combination of positive and negative feedback to the ovaries, pituitary and hypothalamus 

glands controls physiologic changes during the monthly ovulation and endometrial cycles. The anterior 

2 o pituitary secretes two major gonadotropin hormones, follicle-stimulating hormone (FSH) and luteinizing 

hormone (LH), regulated by negative feedback of steroids, most notably by ovarian estradiol. If 
fertilization does not occur, estrogen and progesterone levels decrease. This sudden reduction of the 
ovarian hormones leads to menstruation, the desquamation of the endometrium. 

Hormones further govern all the steps of pregnancy, parturition, lactation, and menopause. 

2 5 During pregnancy large quantities of human chorionic gonadotropin (hCG), estrogens, progesterone, 

and human chorionic somatomammotropin (hCS) are formed by the placenta. hCG, a glycoprotein 
similar to luteinizing hormone, stimulates the corpus luteum to continue producing more progesterone 
and estrogens, rather than to involute as occurs if the ovum is not fertilized. hCS is similar to growth 
hormone and is crucial for fetal nutrition. 

3 o The female breast also matures during pregnancy. Large amounts of estrogen secreted by the 

placenta trigger growth and branching of the breast milk ductal system while lactation is initiated by the 
secretion of prolactin by the pituitary gland. 

Parturition involves several hormonal changes that increase uterine contractility toward the end 
of pregnancy, as follows. The levels of estrogens increase more than those of progesterone. Oxytocin 

18 

BNSDOCID: <WO 01 62927A2J_> 



WO 01/62927 PCT/US01/06059 

is secreted by the neurohypophysis. Concomitantly, uterine sensitivity to oxytocin increases. The fetus 
itself secrete? oxytocin, Cortisol (from adrenal glands), and prostaglandins. 

Menop&use occurs when most of the ovarian follicles have degenerated. The ovary thai 
produces less estradiol, reducing the negative feedback on the pituitary and hypothalamus glands. 
5 Mean levels of circulating FSH and LH increase, even as ovulatory cycles continue., Therefore, the 
ovary is less responsive to gonadotropins, and there is an increase iii the time between menstrual cycles. 
Consequently, menstrual bleeding ceases and reproductive capability ends. 
Cell Differentiation and Proliferation 

Tissue growth involves complex and ordered patterns of cell proliferation; cell differentiation, 

10 and apoptosis. Cell proliferation must be regulated to maintain both the number of cells and their 

spatial organization. This regulation depends upon the appropriate expression of proteins which control 
cell cycle progression in response to extracellular signals, such as growth factors and other mitogens, 
and intracellular cues, such as DNA damage or nutrient starvation. Molecules which directly or 
indirectly modulate cell cycle progression fall into several categories, including growth factors and their 

15 receptors, second messenger and signal transduction proteins, oncogene products, tumor-suppressor 
proteins, and mitosis-promoting factors. 1 

Growth factors were originally described as serum factors required to promote cell 
proliferation. Most growth factors are large, secreted polypeptides that act on cells in their local 
environment. Growth factors bind to and activate specific cell surface receptors and initiate 

20 intracellular signal transduction cascades. Many growth factor receptors are classified as receptor 
tyrosine kinases which undergo autophosphorylation upon ligand binding. Autophosphorylation 
enables the receptor to interact with signal transduction proteins characterized by the presence of SH2 
or SH3 domains (Src homology regions 2 or 3). These proteins then modulate the activity state of 
small G-proteins, such as Ras, Rab, and Rho, along with GTPase activating proteins (GAPs), guanine 

2 5 nucleotide releasing proteins (GNRPs), and other guanine nucleotide exchange factors. Small G 

proteins act as molecular switches that activate other downstream events, such as mitogen-activated 
protein kinase (MAP kinase) cascades. MAP kinases ultimately activate transcription of mitosis- 
promoting genes. 

In addition to growth factors, small signaling peptides and hormones also influence cell 

3 o proliferation. These molecules bind primarily to another class of receptor, the trimeric G-protdn 

coupled receptor (GPCR), found predominantly on the surface of immune, neuronal and neuroendocrine 
cells. Upon ligand binding, the GPCR activates a trimeric G protein which in turn triggers increased 
levels of intracellular second messengers such as phospholipase C, Ca2+, and cyclic AMP. Most 
GPCR-mediated signaling pathways indirectly promote cell proliferation by -causing the secretion or 
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» breakdown of other signaling molecules that have direct mitogenic effects. These signaling cascades 

often involve activation of kinases and phosphatases. Some growth factors, such as some members of 
the transforming growth factor beta (TGF-P) family, act on some cells to stimulate cell proliferation 
and on other cells to inhibit it Growth factors may also stimulate a cell at one concentration and inhibit 
5 the same ceil at another concentration. Most growth factors also have a multitude of other actions 
besides the regulation of cell growth and division: they can control the proliferation, survival, 
differentiation, migration, or function of cells depending on the circumstance. For example, the tumor 
necrosis factor/nerve growth factor (TNF/NGF) family can activate or inhibit cell death, as well as 
regulate proliferation and differentiation. The cell response depends on the type of cell, its stage of 
10 differentiation and transformation status, which surface receptors are stimulated, and the types of 

stimuli acting on the cell (Smith, A. et al. (1994) Cell 76:959-962; and Nocentini, G. et al. (1997) Proc. 
1 Natl. Acad Sci. USA 94:6216-6221). 

Neighboring cells in a tissue compete for growth factors, and when provided with "unlimited" 
quantities in a perfused system will grow to even higher cell densities before reaching density-dependent 
15 , inhibition of cell divisioa Cells often demonstrate an anchorage dependence of cell division as well. 
This anchorage dependence may be associated with the formation of focal contacts linking the 
cytoskeleton with the extracellular matrix (ECM). The expression of ECM components can be 
1 stimulated by growth factors. For example, TGF-P stimulates fibroblasts to produce a variety of ECM 

proteins, including fibronectin, collagen, and tenascin (Pearson, C.A. et al. (1988) EMBO J. 7:2677- 
2 o 298 1). In fact, for some cell types specific ECM molecules, such as laminin or fibronectin, may act as 
growth factors. Tenascin-C and -R, expressed in developing and lesioned neural tissue, provide 
stimulatory/anti-adhesive or inhibitory properties, respectively, for axonal growth (Faissner, A. (1997) 
Cell Tissue Res. 290:331-341). 

Cancers are associated with the activation of oncogenes which are derived from normal cellular 
25 genes. These oncogenes encode oncoproteins which convert normal cells into malignant cells. Some 
oncoproteins are mutant isoforms of the normal protein, and other oncoproteins are abnormally 
expressed with respect to location or amount of expression. The latter category of oncoprotein causes 
cancer by altering transcriptional control of cell proliferatioa Five classes of oncoproteins are known 
to affect cell cycle controls. These classes include growth factors, growth factor receptors, intracellular 
30 signal transducers, nuclear transcription factors, and cell-cycle control proteins. Viral oncogenes are 
integrated into the human genome after infection of human cells by certain viruses. Examples of viral 
oncogenes include v-src, v-abl, and v-fps. 

Many oncogenes have been identified and characterized. These include sis, erbA, erbB, her-2, 
mutated G 8 , src, abl, ras, crk, jun, fos, myc, and mutated tumor-suppressor genes such as RB, p53, 
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mdm2, Cipl, pi 6, and cyclin D. Transformation of normal genes to oncogenes may also occur by 
chromosomal translocation The Philadelphia chromosome, characteristic of chronic myeloid leukemia 
and a subset of acute lymphoblastic leukemias, results from a reciprocal translocation between 
chromosomes 9 and 22 that moves a truncated portion of the proto-oncogene c-abl to the breakpoint 
5 cluster region (bcr) on chromosome 22. 

Tumor-suppressor genes are involved in regulating cell proliferation. Mutations which cause 
reduced or loss of function in tumor-suppressor genes result in uncontrolled cell proliferation. Far 
example, the retinoblastoma gene product (RB), in a non-phosphorylated state, binds several early- 
response genes and suppresses their transcription, thus blocking cell divisioa Phosphorylation of RB 
10 causes it to dissociate from the genes, releasing the suppression, and allowing cell division to proceed. 
Apoptosis 

Apoptosis is the genetically controlled process by which unneeded or defective cells undergo 
programmed cell death. Selective elimination of cells is as important for morphogenesis and tissue 

i 

remodeling as is cell proliferation and differentiation. Lack of apoptosis may result in hyperplasia and 
15 other disorders associated with increased cell proliferation. Apoptosis is also a critical component of 
the immune response. Immune cells such as cytotoxic T-cells and natural killer cells prevent the spread 
of disease by inducing apoptosis in tumor cells and virus-infected cells. In addition, immune cells that 
fail to distinguish self molecules from foreign molecules must be eliminated by apoptosis to avoid an 
autoimmune response. 

20 Apoptotic cells undergo distinct morphological changes. Hallmarks of apoptosis include cell 

shrinkage, nuclear and cytoplasmic condensation, and alterations in plasma membrane topology. 
Biochemically, apoptotic cells are characterized by increased intracellular calcium concentration, 
fragmentation of chromosomal DNA, and expression of novel cell surface components. 

The molecular mechanisms of apoptosis are highly conserved, and many of the key protein 

2 5 regulators and effectors of apoptosis have been identified. Apoptosis generally proceeds in response to 

a signal which is transduced intracellular^ and results in altered patterns of gene expression and protein 
activity. Signaling molecules such as hormones and cytokines are known both to stimulate and to 
inhibit apoptosis through interactions with cell surface receptors. Transcription factors also play an 
important role in the onset of apoptosis. A number of downstream effector molecules, particularly 

3 o proteases such as the cysteine proteases called caspases, have been implicated in the degradation of 

cellular components and the proteolytic activation of other apoptotic effectors. 
Aging and Senescence 

Studies of the aging process or senescence have shown a number of characteristic cellular and 
molecular changes (Fauci et al. (1998) Harrison's Principles of Internal Medicine . McGraw-Hill, New 
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York NY, p.37). These characteristics include increases in chromosome structural abnormalities, DNA 
cross-linking, incidence of single-stranded breaks in DNA, losses in DNA methylation, and degradation 
of telomere regions. In addition to these DNA changes, post-translational alterations of proteins 
increase including, deamidation, oxidation, cross-linking, and nonenzymatic glycation. Still farther 
5 molecular changes occur in the mitochondria of aging cells through deterioration of structure. These 
changes eventually contribute to .decreased function in every organ of the body. 

Biochemical Pathway Molecules 

Biochemical pathways are responsible for regulating metabolism, growth and development, 
10 protein secretion and trafficking, environmental responses, and ecological interactions including 
immune response and response to parasites. 
DNA replication : 1 

Deoxyribonucleic acid (DNA), the genetic material, is found in both the nucleus and 
mitochondria of human cells. The bulk of human DNA is nuclear, in the form of linear chromosomes, 
15 while mitochondrial DNA is circular. DNA replication begins at specific sites called origins of 

replicatioa Bidirectional synthesis occurs from the origin via two growing forks that move in opposite 
directions. Replication is semi-conservative, with each daughter duplex containing one old strand and 
its newly synthesized complementary partner. Proteins involved in DNA replication include DNA 
polymerases, DNAprimase, telomerase, DNAhelicase, topoisomerases, DNAligases, replication 
20 factors, and DNA-binding proteins. 
. DNA Recombination and Repair 

Cells are constantly faced with replication errors and environmental assault (such as ultraviolet 
irradiation) that can produce DNA damage. Damage to DNA consists of any change that modifies the 
structure of the molecule. Changes to DNA can be divided into two general classes, single base 

2 5 changes and structural distortions. Any damage to DNA can produce a mutation, and the mutation may 

produce a disorder, such as cancer. 

Changes in DNA are recognized by repair systems within the cell. These repair systems act to 
correct the damage and thus prevent any deleterious affects of a mutational event Repair systems can 
be divided into three general types, direct repair, excision repair, and retrieval systems. Proteins 

3 o involved in DNA repair include DNA polymerase, excision repair proteins, excision and cross link 

repair proteins, recombination and repair proteins, RAD51 proteins, and BLN and WRN proteins that 
are homologs of RecQ helicase. When the repair systems are eliminated, cells become exceedingly 
sensitive to environmental mutagens, such as ultraviolet irradiatioa Patients with disorders associated 
with a loss in DNA repair systems often exhibit a high sensitivity to environmental mutagens. 



22 



WO 01/62927 PCT/US01/06059 

i 

Examples of such disorders include xeroderma pigmentosum (XP), Bloom's syndrome (BS), and 
Werner's syndrome (WS) (Yamagata, K. et al. (1998) Proc. Natl. Acad ScL USA 95:8733-8738), 
ataxia telangiectasia, Cockayne's syndrome, and Fanconi's anemia. 

Recombination is the process whereby new DNA sequences are generated by the movements of 
5 large pieces of DNA. In homologous recombination, which occurs during meiosis and DNA repair, 
parent DNA duplexes align at regions of sequence similarity, and new DNA molecules form by the 
breakage and joining of homologous segments. Proteins involved include RAD5 1 recombinase. In site- 
specific recombination, two specific but not necessarily homologous DNA sequences are exchanged In 
the immune system this process generates a diverse collection of antibody arid T cell receptor genes. 
10 Proteins involved in site-specific recombination in the immune system include recombination activating 
genes 1 and 2 (RAG1 and RAG2). A defect in immune system site-specific recombination causes 
severe combined immunodeficiency disease in mice. , 
RNA Metabolism 

Ribonucleic acid (RNA) is a linear single-stranded polymer of four nucleotides, ATP, CTP, 
is UTP, and GTP. In most organisms, RNA is transcribed as a copy of DNA, the genetic material of the 
organism. In retroviruses RNA rather than DNA serves as the genetic material. RNA copies of the 
genetic material encode proteins or serve various structural, catalytic, or regulatory roles in organisms. 
RNA is classified according to its cellular localization and function. Messenger RNAs (mRNAs) 
encode polypeptides. Ribosomal RNAs (rRNAs) are assembled, along with ribosomal proteins, into 
20 ribosomes, which are cytoplasmic particles that translate mRNA into polypeptides. Transfer RNAs 
(tRNAs) are cytosolic adaptor molecules that function in mRNA translation by recognizing both an 
mRNA codon and the amino acid that matches that codon. Heterogeneous nuclear RNAs (hnRNAs) 
include mRNA precursors and other nuclear RNAs of various sizes. Small nuclear RNAs (snRNAs) 
are a part of the nuclear spliceosome complex that removes intervening, non-coding sequences (introns) 

2 5 and rejoins ©cons in pre-mRNAs. 

RNA Transcription 

The transcription process synthesizes an RNA copy of DNA Proteins involved include multi- 
subunit RNA polymerases, transcription factors OA, IIB, IID, HE, DF, IIH, and ID. Many 
transcription factors incorporate DNA-binding structural motifs which comprise either a-helices or P- 

3 o sheets that bind to the major groove of DNA. Four well-characterized structural motifs are helix-turn- 

helix, zinc finger, leucine zipper, and helix-loop-helix. 
RNA Processing 

Various proteins are necessary for processing of transcribed RNAs in the nucleus. Pre-mRNA 
processing steps include capping at the 5' end with methylguanosine, polyadenylating the 3' end, and 
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splicing to remove introns. The spliceosomal complex is comprised of five small nuclear 
ribonucleoprotein particles (snRNPs) designated Ul, U2, U4, U5, and U6. Each snRNP contains a 
single species of snRNA and about ten proteins. The RNA components of some snRNPs recognize and 
base-pair with intron consensus sequences. The protein components mediate spliceosome assembly and 
s the splicing reaction. Autoantibodies to snRNP proteins are found in the blood of patients with 
systemic lupus erythematosus (Stryer, L. (1995) Biochemistry W.H. Freeman and Company, New , 
YorkNY,p.863). 

Heterogeneous nuclear ribonucleqprotdns (hnRNPs) have been identified that have roles in 
splicing, exporting of the mature RNAs to the cytoplasm, and mRNA translation (Biamonti, G. et al. 

10 (1998) Clin. Exp. Rheumatol. 16:317-326). Some examples of hnRNPs include the yeast proteins 
Hrplp, involved in cleavage and polyadenylation at the 3' end of the RNA; Cbp80p, involved in 
capping the 5' end of the RNA; and Npl3p, a homolog of mammalian hnRNP Al, involved in export of 
mRNA from the nucleus (Shea, E.C. et al. (1998) Genes Dev. 12:679-691). HnRNPs have been shown 
to be important targets of the autoimmune response in rheumatic diseases (Biamonti, supra) . 

'15 Many snRNP proteins, hnRNP proteins, and alternative splicing factors are characterized by 

an RNA recognition motif (RRM). (Reviewed in Birney, E. et al. (1993) Nucleic Acids Res. 21:5803- 
5816.) The RRM is about 80 amino acids in length and forms four P-strands and two a-hdices 
arranged in an ct/p sandwich. The RRM contains a core RNP-1 octapeptide motif along with 
surrounding conserved sequences. 

20 RNA Stability and Degradation 

RNA helicases alter and regulate RNA conformation and secondary structure by using energy 
derived from ATP hydrolysis to destabilize and unwind RNA duplexes. The most well-characterized 
and ubiquitous family of RNA helicases is the DEAD-box family, so named for the conserved B-type 
ATP-binding motif which is diagnostic of proteins in this family. Over 40 DEAD-box helicases have 

2 5 been identified in organisms as diverse as bacteria, insects, yeast, amphibians, mammals, and plants. 

DEAD-box helicases function in diverse processes such as translation initiation, splicing, ribosome 
assembly, and RNA editing, transport, and stability. Some DEAD-box helicases play tissue- and stage- 
specific roles in spermatogenesis and embryogenesis. (Reviewed in Linda:, P. et al. (1989) Nature 
337:121-122.) 

3 o Overexpression of the DEAD-box 1 protein (DDX1) may play a role in the progression of 

neuroblastoma (Nb) and retinoblastoma (Rb) tumors. Other DEAD-box helicases have been implicated 
either directly or indirectly in ultraviolet light-induced tumors, B cell lymphoma, and myeloid 
malignancies. (Reviewed in Godbout, R. et al. (1998) J. Biol. Chem. 273:21161-21168.) 

Ribonucleases (RNases) catalyze the hydrolysis of phosphodiester bonds in RNA chains, thus 
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cleaving the RNA. For example, RNase P is a ribonucleoprotein enzyme which cleaves the 5' end of 
pre-tRNAs as part of their maturation process. RNase H digests the RNA strand of an RNA/DNA 
hybrid. Such hybrids occur in cells invaded by retroviruses, and RNase H is an important enzyme in 
the retroviral replication cycle. RNase H domains are often found as a domain associated with reverse 
5 transcriptases. RNase activity in serum and cell extracts is elevated in a variety of cancers and 
i infectious diseases (Schein, C.H. (1997) Nat Biotechnol. 15:529-536). Regulation of RNase activity is 

being investigated as a means to control tumor angiogenesis, allergic reactions, viral infection and 
replication, and fungal infections. 
Protein Translation 

10 The eukaryotic ribosome is composed of a 60S (large) subunit and a 40S (small) subunit, 

which together form the SOS ribosome. In addition to the 18S, 28S, 5S, and 5.8S rRNAs, the ribosome 
also contains more than fifty proteins. The ribosomal proteins have a prefix which denotes the subunit 
to which they belong, either L (large) or S (small). Three important sites are identified on the ribosome. 
The aminoacyl-tRNA site (A site) is where charged tRNAs (with the exception of the initiator-tRNA) 

15 bind on arrival at the ribosome. The peptidyl-tRNA site (P site) is where new peptide bonds are 
formed, as well as where the initiator tRNA binds. The exit site (E site) is where deacylated tRNAs 
bind prior to their release from the ribosome, (Translation is reviewed in Stryer, L. (1995) 
Biochemistry . W.H. Freeman and Company, New York NY, pp. 875-908; and Lodish, H. et al. (1995) 
Molecular Cell Biology . Scientific American Books, New York NY, pp. 1 19-138.) 

20 tRNA Charging 

Protein biosynthesis depends on each amino acid forming a linkage with the appropriate tRNA. 
The aminoacyl-tRNA synthetases are responsible for the activation and correct attachment of an amino 
acid with its cognate tRNA The 20 aminoacyl-tRNA synthetase enzymes can be divided into two 
structural classes, Class I and Class II. Autoantibodies against aminoacyl-tRNAs are generated by 

2 5 patients with dermatomyositis and polymyositis, and correlate strongly with complicating interstitial 

lung disease (ELD). These antibodies appear to be generated in response to viral infection, and 
coxsackie virus has been used to induce experimental viral myositis in animals. 
Translation Initiation 

Initiation of translation can be divided into three stages. The first stage brings an initiator 

3 o transfer RNA (Met-tRNA^) together with the 40S ribosomal subunit to form the 43S preinitiation 

complex. The second stage binds the 43S preinitiation complex to the mRNA, followed by migration of 
the complex to the correct AUG initiation codoa Hie third stage brings the 60S ribosomal subunit to 
the 40S subunit to generate an 80S ribosome at the initiation codon. Regulation of translation primarily 
involves the first and second stage in the initiation process (Pain, V.M. (1996) Eur. J. Biochem. 
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236:747-771). 

Several initiation factors, many of which contain multiple subunits, are involved in bringing an 
initiator tRNA and 40S ribosomal subunit together. eEF2, a guanine nucleotide binding protein, recruits 
the initiator tRNA to the 40S ribosomal subunit Only when eIF2 is bound to GTP does it associate 
5 with the initiator tRNA- eIF2B, a guanine nucleotide exchange protein, is responsible for converting 
eIF2 from the GDP-bound inactive form to the GTP-bound active form. Two other factors, elFIA and 
eIF3 bind and stabilize the 40S subunit by interacting with 18S ribosomal RNA and specific ribosomal 
structural proteins. eIF3 is also involved in association of the 40S ribosomal subunit with mRNA The 
Met-tRNAf, elFl A, eIF3, and 40S ribosomal subunit together make up the 43S preinitiation complex 
10 (Pain, supra) . 

Additional factors are required for binding of the 43S preinitiation complex to an mRNA 
molecule, and the prcxpess is regulated at several levels. eIF4F is a complex consisting of three proteins: 
eIF4E, eIF4A, and eIF4G, eIF4E recognizes and binds to the mRNA 5 -terminal m 7 GTP cap, eDF4A is 
a bidirectional RNA-dependent helicase, and eIF4G is a scaffolding polypeptide. eBF4G has three 

15 binding domains. The N-terminal third of eIF4G interacts with eBF4E, the central third interacts with 
eIF4A, and the C-terminal third interacts with eIF3 bound to the 43S preinitiation complex. Thus, 
eIF4G acts as a bridge between the 40S ribosomal subunit and the mRNA (Hentze, M.W. (1997) 
Science 275:500-501). 

The ability of eIF4F to initiate binding of the 43S preinitiation complex is regulated by 

20 structural features of the mRNA. The mRNA molecule has an untranslated region (UTR) between the 
5 ■ cap and the AUG start codoa In some mRNAs this region forms secondary structures that impede 
binding of the 43S preinitiation complex. The helicase activity of eEF4A is thought to function in 
removing this secondary structure to facilitate binding of the 43S preinitiation complex (Pain, supra) . 
Translation Elongation 

2 5 Elongation is the process whereby additional amino acids are joined to the initiator methionine 

to form the complete polypeptide chain. The elongation factors EFla, EFip y, and EF2 are involved 
in elongating the polypeptide chain following initiation. EFla is a GTP-binding protein. In EFla's 
GTP-bound form, it brings an aminoacyl-tRNA to the ribosome's A site. The amino acid attached to 
the newly arrived aminoacyl-tRNA forms a peptide bond with the initiator methionine. The GTP on 

3 o EF1 a is hydrolyzed to GDP, and EF1 a-GDP dissociates from the ribosome. EF1 p y binds EF1 a -GDP 

and induces the dissociation of GDP from EFla, allowing EFla to bind GTP and a new cycle to begin. 

As subsequent aminoacyl-tRNAs are brought to the ribosome, EF-G, another GTP-binding 
protein, catalyzes the translocation of tRNAs from the A site to the P site and finally to the E site of the 
ribosome. This allows the processivity of translation. 
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Translation Termination 

The release factor eRF carries out termination of translation. eRF recognizes stop codons in 
the mRNA, leading to the release of the polypeptide chain from the ribosome. 
Post-Translational Pathways 
5 Proteins may be modified after translation by the addition of phosphate, sugar, prenyl, fatty 

acid, and other chemical groups. These modifications are often required for proper protein activity. 
Enzymes involved in post-translational modification include kinases, phosphatases, 
glycosyltransferases, and prenyltransferases. The conformation of proteins may also be modified after 
translation by the introduction and rearrangement of disulfide bonds (rearrangement catalyzed by 

1 o protein disulfide isomerase), the isomerization of proline sidechains by prolyl isomexase, and by 

interactions with molecular chapexone proteins. 

Proteins may also be cleaved by proteases. Such cleavage may result in activation, 
inactivation, or complete degradation of the protein. Proteases include serine proteases, cysteine 
, proteases, aspartic proteases, and metaHoproteases. Signal peptidase in the endoplasmic reticulum 
1 5 (ER) lumen cleaves the signal peptide from membrane or secretory proteins that are imparted into the 
ER. Ubiquitin proteases are associated with the ubiquitin conjugation system (UCS), a major 
pathway for the degradation of cellular proteins in eukaryotic cells and some bacteria. The UCS 
mediates the elimination of abnormal proteins and regulates the half-lives of important regulatory 
proteins that control cellular processes such as gene transcription and cell cycle progression. In the 

2 o UCS pathway, proteins targeted for degradation are conjugated to a ubiquitin, a small heat stable 

protein. Proteins involved in the UCS include ubiquitin-activating enzyme, ubiquitin-conjugating 
enzymes, ubiquitin-ligases, and ubiquitin C-terminal hydrolases. The ubiquitinated protein is then 
recognized and degraded by the proteasome, a large, multisubunit proteolytic enzyme complex, and 
ubiquitin is released for reutilization by ubiquitin protease. 
25 Lipid Metabolism 

Lipids are water-insoluble, oily or greasy substances that are soluble in nonpolar solvents such 
as chloroform or ether. Neutral fats (triacylglycerols) serve as major fuels and energy stores. Polar 
lipids, such as phospholipids, sphingolipids, glycolipids, and cholesterol, are key structural components 
of cell membranes. 

3 o Lipid metabolism is involved in human diseases and disorders. In the arterial disease 

atherosclerosis, fatty lesions form on the inside of the arterial wall. These lesions promote the loss of 
arterial flexibility and the formation of blood-clots (Guyton, A.C. Textbook of Medical Physiology 
(1991) W.B. Saunders Company, Philadelphia PA, pp.760-763). InTay-Sachs disease, the GM 2 
gangiioside (a sphingolipid) accumulates in lysosomes of the central nervous system due to a lack of the 
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enzyme N-acetylhexosaminidase. Patients suffer nervous system degeneration leading to early death 
(Fauci, A.S. et aL (1998) Harrison's Principles of Internal Medicine McGraw-Hill. New York NY, p. 
2171). The Niemann-Pick diseases are caused by defects in lipid metabolism. Niemann-Pick diseases 
types A and B are caused by accumulation of sphingomyelin (a sphingolipid) and other lipids in the 
5 central nervous system due to a defect in the enzyme sphingomyelinase, leading to neurodegeneration 
and lung disease. Niemann-Pick disease type C results from a defect in cholesterol transport, leading to 
the accumulation of sphingomyelin and cholesterol in lysosomes and a secondary reduction in 
sphingomyelinase activity. Neurological symptoms such as grand mal seizures, ataxia, and loss of 

previously learned speech, manifest 1-2 years after birth. A mutation in the NPC protein, which 

* • i 

10 contains a putative cholesterol-sensing domain, was found in a mouse model of Niemann-Pick disease 
type C (Fauci, supra , p. 2175; Loftus, S.K. et al. (1997) Science 277:232-235). (Lipid metabolism is 
reviewed in Stryer, L. .(1995) Biochemistry. W.H. Freeman and Company, New York NY; Lehninger, 
A. (1982) Principles of Biochemistry Worth Publishers, Inc., New York NY; and ExPASy 
"Biochemical Pathways" index of Boehringer Mannheim World Wide Web site.) 

15 Fatty Acid Synthesis 

Fatty acids are long-chain organic acids with a single carboxyl group and a long non-polar 
hydrocarbon tail. Long-chain fatty acids are essential components of glycolipids, phospholipids, and 
cholesterol, which are building blocks for biological membranes, and of triglycerides, which are ' 
biological fuel molecules. Long-chain fatty acids are also substrates for eicosanoid production, and are 

2 o important in the functional modification of certain complex carbohydrates and proteins. 1 6-carbon and 
18-carbon fatty acids are the most common. 

Fatty acid synthesis occurs in the cytoplasm. In the first step, acetyl-Coenzyme A (CoA) 
carboxylase (ACC) synthesizes malonyl-CoA from acetyl-CoA and bicarbonate. The enzymes which 
catalyze the remaining reactions are covalently linked into a single polypeptide chain, referred to as the 

2 5 multifunctional enzyme fatty acid synthase (FAS). FAS catalyzes the synthesis of palmitate from 
acetyl-CoA and malonyl-CoA. FAS contains acetyl transferase, malonyl transferase, p-ketoacetyl 
synthase, acyl carrier protein, JJ-ketoacyl reductase, dehydratase, enoyl reductase, and thioesterase 
activities. The final product of the FAS reaction is the 1 6-carbon fatty acid palmitate. Further 
elongation, as well as unsaturation, of palmitate by accessory enzymes of the ER produces the variety 

30 of long chain fatty acids required by the individual cell. These enzymes include a NADH-cytochrome 
b 5 reductase, cytochrome b 5 , and a desaturase. 
Phospholipid and Triacylglvcerol Synthesis 

Triacylglycerols, also known as triglycerides and neutral fats, are major energy stores in 
animals. Triacylglycerols are esters of glycerol with three fatty acid chains. Glycerol-3-phosphate is 
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produced from dihydroxyacetone phosphate by the enzyme glycerol phosphate dehydrogenase or from 
glycerol by glycerol kinase. Fatty acid-CoA's are produced from fatty adds by fatty acyl-CoA 
synthetases. Glyercbl-3-phosphate is acylated with two fatty acyl-CoA's by the enzyme glycerol 
phosphate acyltransferase to give phosphatidate. Phosphatidate phosphatase converts phosphatidate to 
5 diacylglycerol, which is subsequently acylated to a triacylglyercol by the enzyme digjyceride 
acyltransferase. Phosphatidate phosphatase and diglyceride acyltransferase form a triacylglyerol 
synthetase complex bound to the ER membrane, 

A major class of phospholipids are the phosphoglycerides, which are composed of a glycerol 
backbone, two fatty acid chains, and a phosphorylatQd alcohol. Phosphoglycerides are components of 

10 cell membranes. Principal phosphoglycerides are phosphatidyl choline, phosphatidyl ethanolamine, 
phosphatidyl serine, phosphatidyl inositol, and diphosphatidyl glycerol. Many enzymes involved in 
phosphoglyceride synthesis are associated with membranes (MeyerS, R.A (1995V Molecular Biology 
and Biotechnology . VCH Publishers Inc., New York NY, pp. 494-501). Phosphatidate is converted to 
CDP-diacylglycerol by the enzyme phosphatidate cytidylyltransferase (ExPASy ENZYME EC 

15 2.7.7.41). Transfer of the diacylglycerol group from CDP-diacylglycerol to serine to yield phosphatidyl 
serine, or to inositol to yield phosphatidyl inositol, is catalyzed by the enzymes CDP-diacylglycerol- 
seine O-phosphatidyltransferase and CDP-diacylglycerol-inositol 3-phosphatidyltransferase> 
respectively (ExPASy ENZYME EC 2.7.8.8; ExPASy ENZYME EC 2.7.8.11). The enzyme 
phosphatidyl serine decarboxylase catalyzes the conversion of phosphatidyl serine to phosphatidyl 

20 ethanolamine, using a pyruvate cofactor (Voelker, D.R. (1997) Biochim. Biophys. Acta 1348:236-244). 
Phosphatidyl choline is formed using diet-derived choline by the reaction of CDP-choline with 1 ,2- 
diacylglycerol, catalyzed by diacylglycerol cholinephosphotransferase (ExPASy ENZYME 2.7.8.2). 
Sterol. Steroid, and Isoprenoid Metabolism 

Cholesterol, composed of four fused hydrocarbon rings with an alcohol at one end, moderates 

25 the fluidity of membranes in which it is incorporated. In addition, cholesterol is used in the synthesis of 
steroid hormones such as Cortisol, progesterone, estrogen, and testosterone. Bile salts derived from 
cholesterol facilitate the digestion of lipids. Cholesterol in the skin forms a barrier that prevents excess 
water evaporation from the body. Farnesyl and geranylgeranyl groups, which are derived from 
cholesterol biosynthesis intermediates, are post-translationally added to signal transduction proteins 

3 o such as ras and protein-targeting proteins such as rab. These modifications are important for the 
activities of these proteins (Guyton, supra : Stryer, supra, pp. 279-280, 691-702, 934). 

Mammals obtain cholesterol derived from both denovo biosynthesis and the diet The liver is 
the major site of cholesterol biosynthesis in mammals. Two acetyl-CoA molecules initially condense to 
form acetoacetyl-CoA, catalyzed by a thiolase. Acetoacetyl-CoA condenses with a third acetyl-CoA to 
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form hydroxymethylglutaryl-CoA (HMG-CoA), catalyzed by HMG-CoA syntiiase. Conversion of 

HMG-CoA to cholesterol is accomplished via a series of enzymatic steps known as the mevalonate 

i 

pathway. The rate-limiting step is the conversion of HMG-CoA to mevalonate by HMG-CoA 
reductase. The drug lovastatin, a potent inhibitor of HMG-CoA reductase, is given to patients to 
5 reduce their serum cholesterol levels. Other mevalonate pathway enzymes include mevalonate kinase, 
phosphomevalonate kinase, diphosphomevalonate decarboxylase, isopentenyldiphosphate isomerase, . 
dimethylallyl transferase, geranyl transferase, farnesyl-diphosphate farnesyltransferase, squalene 
monooxygenase, lanosterol synthase, lathosterol oxidase, and 7-dehydr ©cholesterol reductase. 
Cholesterol is used in the synthesis of steroid hormones such as Cortisol, progesterone, 

l o aldosterone, estrogen, and testosterone. First, cholesterol is converted to pregnenolone by cholesterol 
monooxygenases. The other steroid hormones are synthesized from pregnenolone by a sales of 
enzyme-catalyzed reactions including oxidations, isomerizations, hydroxylations, reductions, and 
demethylations. Examples of these enzymes include steroid A-isomerase, 3p-hydroxy-A 5 -steroid 
dehydrogenase, steroid 21 -monooxygenase, steroid 19-hydroxylase, and 3P-hydroxystercrid 

1 1 5 dehydrogenase. Cholesterol is also the precursor to vitamin D. 

Numerous compounds contain 5 -carbon isoprene units derived from the mevalonate pathway 
intermediate isopentenyl pyrophosphate. Isoprenoid groups are found in vitamin K, ubiquinone, retinal, 
dolichol phosphate (a carrier of oligosaccharides needed for N-linked glycosylation), and farnesyl and 
geranylgeranyl groups that modify proteins. Enzymes involved include farnesyl transferase, polyprenyl 

20 transferases, dolichyl phosphatase, and dolichyl kinase. 
Sphingolipid Metabolism 

Sphingolipids are an important class of membrane lipids that contain sphingosine, a long chain 
amino alcohol. They are composed of one long-chain fatty acid, one polar head alcohol, and 
sphingosine or sphingosine derivative. The three classes of sphingolipids are sphingomyelins, 

25 cerebrosides, and gangliosides. Sphingomyelins, which contain phosphocholine or 

phosphoethanolamine as their head group, are abundant in the myelin sheath surrounding nerve cells. 
Galactocerebrosides, which contain a glucose or galactose head group, are characteristic of the brain. 
Other cerebrosides are found in nonneural tissues. Gangliosides, whose head groups contain multiple 
sugar units, are abundant in the brain, but are also found in nonneural tissues. 

3 o Sphingolipids are built on a sphingosine backbone* Sphingosine is acylated to co-amide by the 

enzyme sphingosine acetyltransferase. Ceramide and phosphatidyl choline are converted to 
sphingomyelin by the enzyme ceramide choline phosphotransferase. Cerebrosides are synthesized by 
the linkage of glucose or galactose to ceramide by a transferase. Sequential addition of sugar residues 
to ceramide by transferase enzymes yields gangliosides. 
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Eicosanoid Metabolism 

Eicosanoids, including prostaglandins, prostacyclin, thromboxanes, and leukotrienes, are 20- 
carbon molecules derived from fatty acids, Eicosanoids are signaling molecules which have roles in 
pain, fever, and inflammation. The precursor of all eicosanoids is arachidonate, which is generated 
5 from phospholipids by phospholipase A^ and from diacylglycerqls by diacylglycerol lipase. 

Leukotrienes are produced from arachidonate by the action of lipoxygenases. Prostaglandin synthase, 
reductases, and isomerases are responsible for the synthesis of the prostaglandins. Prostaglandins have 
roles in inflammation, blood flow, ion transport, synaptic transmission, and sleep. Prostacyclin and the 
thromboxanes are derived from a precursor prostaglandin by the action of prostacyclin synthase and 

1 o thromboxane synthases, respectively. 

Ketone Body Metabolism 

Pairs of acetyl-CoA molecules derived from fatty acid oxidation in the liver can condense to 
form acetoacetyl-CoA, which subsequently forms acetoacetate, D-3-hydroxybutyrate, and acetone. 
These three products are known as ketone bodies. Enzymes involved in ketone body metabolism 
15 include HMG-CoA synthetase, HMG-CoA cleavage enzyme, D-3-hydroxybutyrate dehydrogenase, 
acetoacetate decarboxylase, and 3-ketoacyl-CoA transferase. Ketone bodies are a normal fuel supply 
of the heart and renal cortex. Acetoacetate produced by the liver is transported to cells whore the 
acetoacetate is converted back to acetyl-CoA and enters the citric acid cycle. In times of starvation, 
ketone bodies produced from stored triacylglyerols become an important fuel source, especially for the 

2 o brain. Abnormally high levels of ketone bodies are observed in diabetics. Diabetic coma can result if 

ketone body levels become too great 
Lipid Mobilization 

Within cells, fatty acids are transported by cytoplasmic fatty acid binding proteins (Online 
Mendelian Inheritance in Man (OMIM) *1 34650 Fatty Acid-Binding Protein 1, Liver; FABP1). 
25 Diazepam binding inhibitor (DBI), also known as endozepine and acyl CoA-binding protein, is an 

endogenous y-aminobutyric acid (GAB A) receptor ligand which is thought to down-regulate the effects 
of GAB A DBI binds medium- and long-chain acyl-CoA esters with very high affinity and may 
function as an intracellular carrier of acyl-CoA esters (OMIM *125950 Diazepam Binding Inhibitor; 
DBI; PROSITE PDOC00686 Acyl-CoA-binding protein signature). 

3 o Fat stored in liver and adipose triglycerides may be released by hydrolysis and transported in 

the blood. Free fatty acids are transported in the blood by albumin. Triacylgjycerols and cholesterol 
esters in the blood are transported in lipoprotein particles. The particles consist of a core of 
hydrophobic lipids surrounded by a shell of polar lipids and apolipoproteins. The protein components 
serve in the solubilization of hydrophobic lipids and also contain cell-targeting signals. Lipoproteins 
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1 include chylomicrons, chylomicron remnants, very-low-density lipoproteins (VLDL), intermediate- 
density lipoproteins (IDL), low-density lipoproteins (LDL), and high-density lipoproteins (HDL). 
There is a strong inverse correlation between the levels of plasma HDL and risk of premature coronary 
heart disease. 

5 Triacylglycerols in chylomicrons and VLDL are hydrolyzed by lipoprotein lipases that line 

blood vessels in muscle and other tissues that use fatty acids. Cell surface LDL receptors bind LDL 
particles which are then internalized by endocytosis. Absence of the LDL receptor, the cause of the 
disease familial hypercholesterolemia, leads to increased plasma cholesterol levels and ultimately to 
atherosclerosis. Plasma cholesteryl ester transfer protein mediates the transfer of cholesteryl esters 

1 o from HDL to apolipoprotein B-containing lipoproteins. Cholesteryl ester transfer protein is important 

in the reverse cholesterol transport system and may play a role in atherosclerosis (Y amashita, S. et al. 
(1997) Curr. Opin. Lipidol. 8:101-1 10). Macrophage scavenger receptors, which bind and internalize 
modified lipoproteins, play a role in lipid transport and may contribute to atherosclerosis (Greaves, 
D.R. et al. (1998) Curr. Opin. Lipidol. 9:425-432). 
15 . Proteins involved in cholesterol uptake and biosynthesis are tightly regulated in response to 

cellular cholesterol levels. The sterol regulatory element binding protein (SREBP) is a sterol-responsive 
transcription factor. Undo: normal cholesterol conditions, SREBP resides in the ER membrane. When 
1 cholesterol levels are low, a regulated cleavage of SREBP occurs which releases the extracellular 

domain of the protein. This cleaved domain is then transported to the nucleus where it activates the 

2 o transcription of the LDL receptor gene, and genes encoding enzymes of cholesterol synthesis, by 

binding the sterol regulatory element (SRE) upstream of the genes (Y ang, J. et al. (1995) J. Biol. Chem. 
270:12152-12161). Regulation of cholesterol uptake and biosynthesis also occurs via the oxysterol- 
binding protein (OSBP). OSBP is a high-affinity intracellular receptor for a variety of oxysterols that 
down-regulate cholesterol synthesis and stimulate cholesterol esterification (Lagace> T.A. et al. (1997) 
25 Biochem. J. 326:205-213). 
Beta-oxidation 

Mitochondrial and peroxisomal beta-oxidation enzymes degrade saturated and unsaturated fatty 
acids by sequential removal of two-carbon units from CoA-activated fatty acids. The main beta- 
oxidation pathway degrades both saturated and unsaturated fatty acids while the auxiliary pathway 

3 o performs additional steps required for the degradation of unsaturated fatty acids. 

The pathways of mitochondrial and peroxisomal beta-oxidation use similar enzymes, but have 
different substrate specificities and functions. Mitochondria oxidize short-, medium-, and long-chain 
fatty acids to produce energy for cells. Mitochondrial beta-oxidation is a major energy source for 
cardiac and skeletal muscle. In liver, it provides ketone bodies to the peripheral circulation when 

32 

BNSDOCID: <WO 0162927A2_I_> 



WO 01/62927 



PCT/US01/06059 



glucose levels are low as in starvation, endurance exercise, and diabetes (Eaton, S. et al. (1996) 
Biochem. J. 320:345-357). Peroxisomes oxidize medium-, long-, and very-long-chain fatty acids, 
dicarboxylic fatty acids, branched fatty acids, prostaglandins, xenobiotics, and bile acid intermediates. 
The chief roles of peroxisomal beta-oxidation are to shorten toxic lipophilic carboxylic acids to 
5 facilitate their excretion and to shorten very-long-chain fatty acids prior to mitochondrial beta-oxidation 
1 (Mannaerts, G.P. and PP. van Veldhoven (1993) Biochimie 75:147-158). 

Enzymes involved in beta-oxidation include acyl CoA synthetase, carnitine acyltransferase, 
acyl CoA dehydrogenases, enoyl CoA hydratases, L-3-hydroxyacyl CoA dehydrogenase, {3-ketothiolase, 
2,4-dienoyl CoA reductase, and isomerase. 
10 Li pid Cleavage and Degradation 

Triglycerides are hydrolyzed to fatty acids and glycerol by lipases. Lysophospholipases 
(LPLs) are widely distributed enzymes that metabolize intracellular lipids, and occur in numerous 
isoforms. Small isoforms, approximately 15-30 kD, function as hydrolases; large isoforms, those 

i 

exceeding 60 kD, function both as hydrolases and transacylases. A particular substrate for LPLs, 
15 lysophosphatidylcholine, causes lysis of cell membranes when it is formed or imported into a cell. 
LPLs are regulated by lipid factors including acylcarnitine, arachidonic acid, and phosphatidic acid. • 
These lipid factors are signaling molecules important in numerous pathways, including the 
inflammatory response. (Anderson, R. et al. (1994) Toxicol. Appl. Pharmacol. 125:176-183; Sefle, H. 
et al. (1993); Eur. J. Biochem. 212:411-416.) 

2 o The secretory phospholipase A2 (PL A2) superf amily comprises a number of heterogeneous 

enzymes whose common feature is to hydrolyze the sn-2 fatty acid acyl ester bond of 
phosphoglycerides. Hydrolysis of the glycerophospholipids releases free fatty acids and 
lysophospholipids. PLA2 activity generates precursors for the biosynthesis of biologically active lipids, 
hydroxy fatty acids, and platelet-activating factor. PLA2 hydrolysis of the sn-2 ester bond in 
25 phospholipids generates free fatty acids, such as arachidonic acid and lysophospholipids. 
Carbon and Carbohydrate Metabolism 

Carbohydrates, including sugars or saccharides, starch, and cellulose, are aldehyde or ketone 
compounds with multiple hydroxyl groups. The importance of carbohydrate metabolism is 
demonstrated by the sensitive regulatory system in place for maintenance of blood glucose levels. Two 

3 0 pancreatic hormones, insulin and glucagon, promote increased glucose uptake and storage by ceils, and 

increased glucose release from cells, respectively. Carbohydrates have three important roles in 
mammalian cells. First, carbohydrates are used as energy stores, fuels, and metabolic intermediates. 
Carbohydrates are broken down to form energy in glycolysis and are stored as glycogen for late- use. 
Second, the sugars deoxyribose and ribose form part of the structural support of DNA and RNA, 
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respectively. Third, carbohydrate modifications are added to secreted and membrane proteins and lipids 
as they traverse the secretory pathway. Cefl surface carbohydrate-containing macromolecules, 
including glycoproteins, glycolipids, and transmembrane proteoglycans, mediate adhesion with other 
cells and with components of the extracellular matrix. The extracellular matrix is comprised of diverse 
5 glycoproteins, glycosaminoglycans (GAGs), and carbohydrate-binding proteins which are secreted from 
the cell and assembled into an organized meshwork in close association with the cell surface. The 
interaction of the cell with the surrounding matrix profoundly influences cell shape, strength, flexibility, 
motility, and adhesion. These dynamic properties are intimately associated with signal transduction 
pathways controlling cell proliferation and differentiation, tissue construction, and embryonic 
10 development 

Carbohydrate metabolism is altered in several disorders including diabetes mellitus, 
hyperglycemia, hypoglycemia, galactosemia, galactokinase deficiency, and UDP-galactose-4-epimerase 
deficiency (Fauci, A.S. et al. (1998) Harrison's Principles of Internal Medicine* McGraw-Hill, New 
York NY, pp. 2208-2209). Altered carbohydrate metabolism is associated with cancer. Reduced GAG 

is and proteoglycan expression is associated with human lung carcinomas (Nackaerts, K. et al. (1997) Int. 
J. Cancer 74:335-345). The carbohydrate determinants sialyl Lewis A and sialyl Lewis X are 
frequently expressed on human cancer cells (Kannagi, R. (1997) Glycoconj. J. 14:577-584). . 
Alterations of the N-linked carbohydrate core structure of cell surface glycoproteins are linked to colon 
and pancreatic cancers (Schwarz, R£. et al. (1996) Cancer Lett. 107:285-291). Reduced expression 

20 of the Sda blood group carbohydrate structure in cell surface glycolipids and glycoproteins is observed 
in gastrointestinal cancer (Dohi, T. et al. (1996) Int. J. Cancer 67:626-663). (Carbon and 
carbohydrate metabolism is reviewed in Stryer, L. (1995) Biochemistry W.H. Freeman and Company, 
New York NY; Lehninger, A.L. (1982) Principles of Biochemistry Worth Publishers Inc., New York 
NY; and Lodish, H. et al. (1995) Molecular Cell Biology Scientific American Books, New York NY.) 

25 Glycolysis 

Enzymes of the glycolytic pathway convert the sugar glucose to pyruvate while simultaneously 
producing ATP. The pathway also provides building blocks for the synthesis of cellular components 
such as long-chain fatty acids. After glycolysis, pyrvuate is converted to acetyl-Coenzyme A, which, in 
aerobic organisms, enters the citric acid cycle. Glycolytic enzymes include hexokinase, phosphoglucose 
3 o isomerase, phosphofructokinase, aldolase, triose phosphate isomerase, glyceraldehyde 3-phosphale 
dehydrogenase, phosphoglycerate kinase, phosphoglyceromutase, enolase, and pyruvate kinase* Of 
these, phosphofructokinase, hexokinase, and pyruvate kinase are important in regulating the rate of 
glycolysis. 
Gluconeogenesis 
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Gluconeogenesis is the synthesis of glucose from noncarbohydrate precursors such as lactate 
and amino acids. The pathway, which functions mainly in times of starvation and intense exercise, 
occurs mostly in the liver and kidney. Responsible enzymes include pyruvate carboxylase, 
phosphoenolpynivate carboxykinase, fructose 1,6-bisphosphatase, and glucose-6-phosphatase. 
5 Pentose Phosphate Pathway ' ■ 

Pentose phosphate pathway enzymes are responsible for 'generating the reducing agent 
NADPH t while at the same time oxidizing glucose-6-phosphate to ribose-5-phosphate. Ribose-5- 
phosphate and its derivatives become part of important biological molecules such as ATP, Coenzyme 
A, NAD\ FAD, RNA, and DNA The pentose phosphate pathway has both oxidative and non- 

10 oxidative branches. The oxidative branch steps, which are catalyzed by the enzymes glucose-6- 
phosphate dehydrogenase, lactonase, and 6-phosphogluconate dehydrogenase, convert glucose-6- 
phosphate and NADP+ to ribulose-6-phosphate and NADPH. The non-oxidative branch steps, which 
are catalyzed by the enzymes phosphopentose isomer ase, phosphopentose epimerase, transketolase, and 
trans aldolase, allow the interconversion of three-, four-, five-, six-, and seven-carbon sugars. 

15 Glucouronate Metabolism 

Glucuronate is a monosaccharide which, in the form of D-glucuronic acid, is found in the 
GAGs chondroitin and dermatan. D-glucuronic acid is also important in the detoxification and 
excretion of foreign organic compounds such as phenol. Enzymes involved in glucuronate metabolism 
.include UDP-glucose dehydrogenase and glucuronate reductase. 

20 Disaccharide Metabolism 

Disaccharides must be hydrolyzed to monosaccharides to be digested. Lactose, a disaccharide 
found in milk, is hydrolyzed to galactose and glucose by the enzyme lactase. Maltose is derived from 
plant starch and is hydrolyzed to glucose by the enzyme maltase. Sucrose is derived from plants and is 
hydrolyzed to glucose and fructose by the enzyme sucrase. Trehalose, a disaccharide found mainly in 

2 5 insects and mushrooms, is hydrolyzed to glucose by the enzyme trehalase {OMIM *2753t>0 Trehalase; 
Ruf, J. et al. (1990) J. Biol. Cheia 265:15034-15039). Lactase, maltase, sucrase, and trehalase are 
bound to mucosal cells lining the small intestine, where they participate in the digestion of dietary 
disaccharides. The enzyme lactose synthetase, composed of the catalytic subunit galactosyltransferase 
and the modifier subunit a-lactalbumin, converts UDP-galactose and glucose to lactose in the mammary 

30 glands. 

Glycogen. Starch, and Chitin Metabolism 

Glycogen is the storage form of carbohydrates in mammals. Mobilization of glycogen 
maintains glucose levels between meals and during muscular activity. Glycogen is stored mainly in the 
liver and in skeletal muscle in the form of cytoplasmic granules. These granules contain enzymes that 
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1 catalyze the synthesis and degradation of glycogen, as wen as enzymes that regulate these processes. 
Enzymes that catalyze the degradation of glycogen include glycogen phosphorylase, a transferase, a- 
1 ,6-glucosidase, and phosphoghicorautase. Enzymes that catalyze the synthesis of glycogen include 
UDP-glucose pyrophosphatase, glycogen synthetase, a branching enzyme, and nucleoside 
5 diphosphokinase. The enzymes of glycogen synthesis and degradation are tightly regulated by the 
hormones insulin, glucagon, and epinephrine. Starch, a plant-derived polysaccharide, is hydrolyzed to 
maltose, maltotriose, and a-dextrin by a-amylase, an enzyme secreted by the salivary glands and 
pancreas. Chitin is a polysaccharide found in insects and Crustacea. A chitotriosidase is secreted by 
macrophages and may play a role in the degradation of chitin-containing pathogens (Boot, R.G. et al. 

10 (1995) J. BioLChem. 270:26252-26256). 
Peptidoglvcans and Glvcosaminoglvcans 

Glycosaminoglycans (GAGs) are anionic linear unbranched polysaccharides composed of 
repetitive disaccharide units. These repetitive units contain a derivative of an amino sugar, dither 
glucosamine or galactosamine. GAGs exist free or as part of proteoglycans, large molecules composed 

15 of a core protein attached to one or more GAGs. GAGs are found on the ceil surface, inside cells, and 
in the extracellular matrix. Changes in GAG levels are associated with several autoimmune diseases 
including autoimmune thyroid disease, autoimmune diabetes mellitus, arid systemic lupus 
1 erythematosus (Hansen, C. et al. (1996) Clin. Exp. Rheum. 14 (Suppl. 15):S59-S67). GAGs include 

chondroitin sulfate, keratan sulfate, heparin, heparan sulfate, dermatan sulfate, and hyaluronan. 

20 The GAG hyaluronan (HA) is found in the extracellular matrix of many cells, especially in soft 

connective tissues, and is abundant in synovial fluid (Pitsillides, A A et aL (1993) InL J. Exp. PathoL 
74:27-34). HA seems to play important roles in cell regulation, development, and differentiation 
(Laurent, T.C. and J.R. Fraser (1992) FASEB J. 6:2397-2404). Hyaluronidase is an enzyme that 
degrades HA to oligosaccharides. Hyaluronidases may function in cell adhesion, infection, 

2 5 angiogenesis, signal transduction, reproduction, cancer, and inflammation. 

Proteoglycans, also known as peptidoglvcans, are found in the extracellular matrix of 
connective tissues such as cartilage and are essential for distributing the load in weight-bearing joints. 
Cell-surface-attached proteoglycans anchor cells to the extracellular matrix. Both extracellular and 
cell-surface proteoglycans bind growth factors, facilitating their binding to cell-surface receptors and 

3 o subsequent triggering of signal transduction pathways. 

Amino Acid and Nitrogen Metabolism 

NH 4 + is assimilated into amino acids by the actions of two enzymes, glutamate 
dehydrogenase and glutamine synthetase. The carbon skeletons of amino acids come from the 
intermediates of glycolysis, the pentose phosphate pathway, or the citric acid cycle. Of the twenty 
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amino acids used in proteins, humans can synthesize only thirteen (nonessential amino acids). The 
remaining nine must come from the diet (essential amino acids). Enzymes involved in nonessential 
amino acid biosynthesis include glutamate kinase dehydrogenase, pyrroline carboxylate reductase, 
asparagine synthetase, phenylalanine oxygenase, methionine adenosyltransferase, 
5 adenosylhomocysteinase, cystathionine P-synthase, cystathionine y-lyase, phosphoglycerate 
dehydrogenase, phosphoserine transaminase, phosphoserine phosphatase, serine 
hydroxylmethyltransferase, and glycine synthase. 

Metabolism of amino acids takes place almost entirely in the liver, where the amino group is 
removed by aminotransferases (transaminases), for example, alanine aminotransferase. The amino 

1 o group is transferred to a-ketoglutarate to form glutamate. Glutamate dehydrogenase converts 

glutamate to NH 4 + and a-ketoglutarate. NH 4 + is converted to urea by the urea cycle which is 
catalyzed by the enzymes arginase, ornithine transcarbamoylase, arginosuccinate synthetase, and 
arginosuccinase. Carbamoyl phosphate synthetase is also involved in urea formation. Enzymes 
involved in the metabolism of the carbon skeleton of amino acids include serine dehydratase, 

is asparaginase, glutaminase, propionyl CoA carboxylase, methylmalonyl CoA mutase, branched-chain 
a-keto dehydrogenase complex, isovaleryl CoA dehydrogenase, P-methylcrotonyl CoA carboxylase, 
phenylalanine hydroxylase, p-hydroxylphenylpyruvate hydroxylase, and homogentisate oxidase. 

Poly amines, which include spermidine, putrescine, and spermine, bind tightly to nucleic acids 
and are abundant in rapidly proliferating cells. Enzymes involved in polyamine synthesis include 

20 ornithine decarboxylase. 

Diseases involved in amino acid and nitrogen metabolism include hyperammonemia, 
carbamoyl phosphate synthetase deficiency, urea cycle enzyme deficiencies, methylmalonic aciduria, 
maple syrup disease, alcaptonuria, and phenylketonuria. 
Energy Metabolism 

2 5 Cells derive energy from metabolism of ingested compounds that may be roughly categorized 

as carbohydrates, fats, or proteins. Energy is also stored in polymers such as triglycerides Xfats) and 
glycogen (carbohydrates). Metabolism proceeds along separate reaction pathways connected by key 
intermediates such as acetyl coenzyme A (acetyl-CoA). Metabolic pathways feature anaerobic and 
aerobic degradation, coupled with the energy-requiring reactions such as phosphorylation of 

3 o adenosine diphosphate (ADP) to the triphosphate (ATP) or analogous phosphorylations of guanosine 

(GDP/GTP), uridine (UDPAJTP), or cytidine (CDP/CTP). Subsequent dephosphorylation of the 
triphosphate drives reactions needed for cell maintenance, growth, and proliferation. 

Digestive enzymes convert carbohydrates and sugars to glucose; fructose and galactose are 
converted in the liver to glucose. Enzymes involved in these conversions include galactose-1- 
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phosphate uridyl transferase and UDP-galactose-4 epimerase. In the cytoplasm, glycolysis converts 
glucose to pyruvate in a series of reactions coupled to ATP synthesis. 

Pyruvate is transported into the mitochondria and converted to acetyl-CoA for oxidation via 
the citric add cycle, involving pyruvate dehydrogenase components, dihydrolipoyl transacetylase, and 
5 dihydrolipoyl dehydrogenase. Enzymes involved in the citric acid cycle include: citrate synthetase, 
aconitases, isocitrate dehydrogenase, alpha-ketoglutarate dehydrogenase complex including 
transsuccinylases, succinyl CoA synthetase, succinate dehydrogenase, fumarases, and malate 
dehydrogenase. Acetyl CoA is oxidized to C0 2 with concomitant formation of NADH, FADH 2 , and 
GTP. In oxidative phosphorylation, the transport of electrons from NADH and FADH 2 to oxygen by 

10 dehydrogenases is coupled to the synthesis of ATP from ADP and Pi by the F^i ATPase complex in 
the mitochondrial inner membrane. Enzyme complexes responsible for electron transport and ATP 
synthesis include the F^ ATPase complex, ubiqumone(CoQ)-cytochrome c reductase, ubiquinone 
reductase, cytochrome b, cytochrome FeS protein, and cytochrome c oxidase. , 

Triglycerides are hydrolyzed to fatty acids and glycerol by lipases. Glycerol is then 

15 phosphorylated to glycerol-3-phosphate by glycerol kinase and glycerol phosphate dehydrogenase, 
and degraded by the glycolysis. Fatty acids are transported into the mitochondria as fatty acyl- 
carnitine esters and undergo oxidative degradation. 

In addition to metabolic disorders such as diabetes and obesity, disorders of energy 
metabolism are associated with cancers (Dorward, A. et al. (1997) J. Bioenerg. Biomembr. 29:385- 

20 392), autism (Lombard, J. (1998) Med. Hypotheses 50:497-500), neurodegenerative disorders (Alexi, 
T. et al. (1998) Neuroreport 9:R57-64), and neuromuscular disorders (DiMauro, S. et at. (1998) 
Biochim. Biophys. Acta 1366: 199-210). The myocardium is heavily dependent on oxidative 
metabolism, so metabolic dysfunction often leads to heart disease (DiMauro, S. and M. Hirano (1998) 
Curr. Opin. Cardiol. 13:190-197). 

25 For a review of energy metabolism enzymes and intermediates, see Stryer, L. et al. (1995) 

Biochemistry. W.H. Freeman and Co., San Francisco CA, pp. 443-652. For a review of energy 
metabolism regulation, see Lodish, H. et al. (1995) Molecular Cell Biology . Scientific American 
Books, New York NY, pp. 744-770. 
Cof actor Metabolism 

30 Cof actors, including coenzymes and prosthetic groups, are small molecular weight inorganic 
or organic compounds that are required for the action of an enzyme. Many cofactors contain vitamins 
as a component. Cofactors include thiamine pyrophosphate, flavin adenine dinucleotide, flavin 
mononucleotide, nicotinamide adenine dinucleotide, pyridoxal phosphate, coenzyme A, 
tetrahydrofolate, lipoamide, and heme. Hie vitamins biotin and cobalamin are associated with 
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enzymes as well. Heme, a prosthetic group found in myoglobin and hemoglobin, consists of 
protoporphyrin group bound to iron. Porphyrin groups contain four substituted pyrroles covalentiy 
joined in a ring, often with a bound metal atom. Enzymes involved in porphyrin synthesis include 5- 
aminolevulinate synthase, 6-aminolevulinate dehydrase, porphobilinogen deaminase, and cosynthase. 
5 Deficiencies in heme formation cause porphyrias. Heme is broken down as a part of erythrocyte ' 
' turnover. Enzymes involved in heme degradation include heme oxygenase and biliverdin reductase. 

Iron is a required cofactor for many enzymes. Besides the heme-containing enzymes, iron is 
found in iron-sulfur clusters in proteins including aconitase, succinate dehydrogenase, and NADH-Q 
reductase. Iron is transported in the blood by the protein transferrin. Binding of transferrin to the 
10 transferrin receptor on cell surfaces allows uptake by receptor mediated endocytosis. Cytosolic iron is 
bound to ferritin protein. 

A molybdenum-containing cofactor (molybdopterin) is found in enzymes including sulfite 
oxidase, xanthine dehydrogenase, and aldehyde oxidase. Molybdopterin biosynthesis is performed by 
two molybdenum cofactor synthesizing enzymes. Deficiencies in these enzymes cause mental 
15 retardation and lens dislocation. Other diseases caused by defects in cofactor metabolism include 
pernicious anemia and methylmalonic aciduria. 
Secretion and Trafficking 

Eukaryotic ceils are bound by a lipid bilayer membrane and subdivided into functionally 
distinct, membrane bound compartments. The membranes maintain the essential differences between 
20 the cytosol, the extracellular environment, and the lumenal space of each intracellular organelle. As 
lipid membranes are highly impermeable to most polar molecules, transport of essential nutrients, 
metabolic waste products, cell signaling molecules, macromolecules and proteins across lipid 
membranes and between organelles must be mediated by a variety of transport-associated molecules. 
Protein Trafficking 

25 In eukaryotes, some proteins are synthesized on ER-bound ribosomes, co-translationally 

imported into the ER, delivered from the ER to the Golgi complex for post-translational processing and 
sorting, and transported from the Golgi to specific intracellular and extracellular destinations. All cells 
possess a constitutive transport process which maintains homeostasis between the cell and its 
environment In many differentiated cell types, the basic machinery is modified to carry out specific 

3 o transport functions. For example, in endocrine glands, hormones and other secreted proteins are 

packaged into secretory granules for regulated exocytosis to the cell exterior. In macrophage, foreign 
extracellular material is engulfed (phagocytosis) and delivered to lysosomes for degradation. In fat and 
muscle cells, glucose transporters are stored in vesicles which fuse with the plasma membrane only in 
response to insulin stimulation. 
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The Secretory Pathway 

Synthesis of most integral membrane proteins, secreted proteins, and proteins destined for the 

lumen of a particular organdie occurs on ER-bound ribosomes. These proteins are co-translationally 

i t ' 

imported into the ER. The proteins leave the ER via membrane-bound vesicles which bud off the ER at 
5 specific sites and fuse with each other (homotypic fusion) to form the ER-Golgi Intermediate 

Compartment (ERGIC). The ERGIC matures progressively through the cis, medial, and trans cisternal 
stacks of the Golgi, modifying the enzyme composition by retrograde transport of specific Gdlgi 
enzymes. In this way, proteins moving through the Golgi undergo post-translational modification, such 
as glycosylation. The final Golgi compartment is the Trans-Golgi Network (TGN), where both 

1 o membrane and lumenal proteins are sorted for their final destinatioa Transport vesicles destined for 

intracellular compartments, such as the lysosome, bud off the TGN. What remains is a secretory 
vesicle which contains proteins destined for the plasma membrane, such as receptors, adhesion 
molecules, and ion channels, and secretory proteins, such as hormones, neurotransmitters, and digestive 
enzymes. Secretory vesicles eventually fuse with the plasma membrane (Glick, B.S. and V. Malhotra 

15 (1998) Cell 95:883-889). 

The secretory process can be constitutive or regulated. Most cells have a constitutive pathway 
for secretion, whereby vesicles derived from maturation of the TGN require no specific signal to fuse 
with the plasma membrane. In many cells, such as endocrine cells, digestive cells, and neurons, vesicle 
pools derived from the TGN collect in the cytoplasm and do not fuse with the plasma membrane until 

20 they are directed to by a specific signal. 
Endocvtosis 

Endocytosis, wherein cells internalize material from the extracellular environment, is essential 
for transmission of neuronal, metabolic, and proliferative signals; uptake of many essential nutrients; 
and defense against invading organisms. Most cells exhibit two forms of endocytosis. The first, 

2 5 phagocytosis, is an actin-driven process exemplified in macrophage and neutrophils. Material to be 

endocytosed contacts numerous cell surface receptors which stimulate the plasma membrane to extend 
and surround the particle, enclosing it in a membrane-bound phagosome. In the mammalian immune 
system, IgG-coated particles bind Fc receptors on the surface of phagocytic leukocytes. Activation of 
the Fc receptors initiates a signal cascade involving src-family cytosolic kinases and the monomelic 

3 o GTP-binding (G) protein Rho. The resulting actin reorganization leads to phagocytosis of the particle. 

This process is an important component of the humoral immune response, allowing the processing and 
presentation of bacterial-derived peptides to antigen-specific T-lymphocytes. 

The second form of endocytosis, pinocytosis, is a more generalized uptake of material from the 
external milieu. Like phagocytosis, pinocytosis is activated by ligand binding to cell surface receptors. 
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Activation of individual receptors stimulates an internal response that includes coalescence of the 

receptor-ligand complexes and formation of clathrin-coated pits. Invagination of the plasma membrane 

at clathrin-coated pits produces an endocytic vesicle within the cell cytoplasm. , These vesicles undergo 

homotypic fusion to form an early endosomal (EE) compartment The tubulovesicular EE serves as a 

5 sorting site for incoming material. ATP-driven proton pumps in the EE membrane lowers the pH of the 

EE lumen (pH 6.3-6.8). The acidic environment causes many ligands to dissociate from their receptors. 

The receptors, along with membrane and other integral membrane proteins, are recycled back to the 

plasma membrane by budding off the tubular extensions of the EE in recycling vesicles (RV). This 

* . i 

selective removal of recycled components produces a, carrier vesicle containing ligand and other 

1 o material from the external environment. The carrier vesicle fuses with TGN-derived vesicles which 

contain hydrolytic enzymes. The acidic environment of the resulting late endosome (LE) activates the 
hydrolytic enzymes which degrade the ligands and other material. As digestion takes place, the LE 
fuses with the lysosome where digestion is completed (Mellman, I. (1996) Anna. Rev. Cell Dev. Biol. 
12:575-625). 

is Recycling vesicles may return directly to the plasma membrane. Receptors internalized and 

returned directly to the plasma membrane have a turnover rate of 2-3 minutes. Some RVs undergo 
microtubule-directed relocation to a perinuclear site, from which they then return to the plasma 
membrane. Receptors following this route have a turnover rate of 5-10 minutes. Still other RVs are 
retained within the cell until an appropriate signal is received (Mellman, supra ; and James, D.E. et al. 

20 (1994) Trends Cell Biol. 4:120-126). 
Vesicle Formation 

Several steps in the transit of material along the secretory and endocytic pathways require the 
formation of transport vesicles. Specifically, vesicles form at the transitional endoplasmic reticulum 
(tER), the rim of Golgi cisternae, the face of the Trans-Golgi Network (TGN), the plasma membrane 

2 5 (PM), and tubular extensions of the endosomes. The process begins with the budding of a vesicle out of 

the donor membrane. The membrane-bound vesicle contains proteins to be transported and is 
surrounded by a protective coat made up of protein subunits recruited from the cytosol. The initial 
budding and coating processes are controlled by a cytosolic ras-like GTP-binding protein, ADP- 
ribosylating factor (Arf), and adapter proteins (AP). Different isoforms of both Arf and AP are 

3 o involved at different sites of budding. Another small G-protein, dynamin, forms a ring complex around 

the neck of the forming vesicle and may provide the mechanochemical force to accomplish the final step 
of the budding process. The coated vesicle complex is then transported through the cytosol. During the 
transport process, Arf-bound GTP is hydrolyzed to GDP and the coat dissociates from the transport 
vesicle (West, M.A. et al. (1997) J. Cell Biol. 138:1239-1254). Two different classes of coat protein 
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have also been identified Clathrin coats form on the TGN and PM surfaces, whereas coatomer or COP 
coats form on the ER and Golgi. COP coats can further be distinguished as COPI, involved in 
retrograde traffic through the Golgi and from the Golgi to the ER, and COPII, involved in anterograde 
traffic from the ER to the Golgi (Mellman, supra) . The COP coat consists of two major components, a 
5 G-protein (Arf or Sar) and coat protomer (coatomer). Coatomer is an equimolar complex of seven 
proteins, tamed alpha-, beta-, beta'-, gamma-, delta-, epsilon- and zeta-COP. (Harter, C. andF.T. 
Wieland (1998) Proc. Natl. Acad Sci. USA 95:1 1649-1 1654.) 
Membrane Fusion 

Transport vesicles undergo homotypic or heterotypic fusion in the secretory and endocytotic 
10 pathways. Molecules required for appropriate targeting and fusion of vesicles with their target 

membrane include proteins incorporated in the vesicle membrane, the target membrane, and proteins 
recruited from the cytosol. During budding of the vesicle from the donor compartment, an integral 
membrane protein, VAMP (vesicle-associated membrane protein) is incorporated into the vesicle. Soon 
after the vesicle uncoats, a cytosolic prenylated GTP-binding protein, Rab (a member of the Ras 
15 superfamily), is inserted into the vesicle membrane. GTP-bound Rab proteins are directed into nascent 
transport vesicles whore they interact with VAMP. Following vesicle transport, GTPase activating 
proteins (GAPs) in the target membrane convert Rab proteins to the GDP-bound form. A cytosolic 
protein, guanine-nucleotide dissociation inhibitor (GDI) helps return GDP-bound Rab proteins to their 
membrane of origin. Several Rab isoforms have been identified and appear to associate with specific 
2 o compartments within the cell. Rab proteins appear to play a role in mediating the function of a viral 
gene, Rev, which is essential for replication of HTV-1, the virus responsible for AIDS (FlaveH, R.A. et 
al. (1996) Proc. Natl. Acad. Sci. USA 93:4421-4424). 

Docking of the transport vesicle with the target membrane involves the formation of a complex 
between the vesicle SNAP receptor (v-SNARE), target membrane (t-) SNAREs, and certain other 

2 5 membrane and cytosolic proteins. Many of these other proteins have been identified although their 

exact functions in the docking complex remain uncertain (Tellam, J.T. et al. (1995) J. Biol. Chem. 
270:5857-5863; and Hata, Y. and T.C. Sudhof (1995) J. Biol. Chem. 270:13022-13028). 
N-ethylmaleimide sensitive factor (NSF) and soluble NSF-attachment protein (a-SNAP and P-SNAP) 
are two such proteins that are conserved from yeast to man and function in most intracellular membrane 

3 o fusion reactions. Seel represents a family of yeast proteins that function at many different stages in the 

secretory pathway including membrane fusion. Recently, mammalian homologs of Seel , called 
Munc-18 proteins, have been identified (Katagiri, H. et al. (1995) J. Biol. Chem. 270:4963-4966; Hata 
etal. supra) . 

The SNARE complex involves three SNARE molecules, one in the vesicular membrane and 
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two in the target membrane, Synaptotagmin is an integral membrane protein in the synaptic vesicle 
which, associates with the t-SNARE syntaxiii in the docking complex. Synaptotagmin binds calcium in 
a complex with negatively charged phospholipids, which allows the cytosolic SNAP protein to displace 
synaptotagmin from syntaxin and fusion to occur. Thus, synaptotagmin is a negative regulator of 
5 fusion in the neuron (Littleton, J.T. et al. (1993) Cell 74:1 125-1 134). The most abundant membrane 
protein of synaptic vesicles appears to be the glycoprotein synaptophysin, a 38 WDa protein with four 
transmembrane domains. 

Specificity between a vesicle and its target is derived from the v-SNARE, t-SNAREs, and 
associated proteins involved. Different isoforms of SNAREs and Rabs show distinct cellular and 
10 subcellular distributions. VAMP-l/synaptobrevin, membrane- anchored synaptosome-associated 

protein of 25 kDa (SNAP-25), syntaxin-1, Rab3A, Rabl5, and Rab23 are predominantly expressed in 
the brain and nervous system. Different syntaxin, VAMP, and Rab proteins are associated with 
distinct subcellular compartments and their vesicular carriers. 
Nuclear Transport 

1 5 Transport of proteins and RN A between the nucleus and the cytoplasm occurs through nuclear 

pore complexes (NPCs). NPC -mediated transport occurs in both directions through the nuclear 
envelope. All nuclear proteins are imported from the cytoplasm, their site of synthesis; tRNA and 
mRNA are exported from the nucleus, their site of synthesis, to the cytoplasm, their site of function. 
Processing of small nuclear RNAs involves export into the cytoplasm, assembly with proteins and 

2 o modifications such as hypermethylation to produce small nuclear ribonuclear proteins (snRNPs), and 

subsequent import of the snRNPs back into the nucleus. The assembly of ribosomes requires the initial 
import of ribosomal proteins from the cytoplasm, their incorporation with RNA into ribosomal 
subunits, and export back to the cytoplasm. (GSrlich, D. andl.W. Mattaj (1996) Science 271:15 13- 
1518.) 

25 The transport of proteins and mRNAs across the NPC is selective, dependent on nuclear 

localization signals, and generally requires association with nuclear transport factors. Nuclear 
localization signals (NLS) consist of short stretches of amino acids enriched in basic residues. NLS are 
found on proteins that are targeted to the nucleus, such as the glucocorticoid receptor. The NLS is 
recognized by the NLS receptor, importin, which then interacts with the monomelic GTP-binding 

3 o protein Ran. This NLS protein/receptor/Ran complex navigates the nuclear pore with the help of the 

homodimeric protein nuclear transport factor 2 (NTF2). NTF2 binds the GDP-bound form of Ran and 
to multiple proteins of the nuclear pore complex containing FXFG repeat motifs, such as p62. 
(Paschal, B. et al. (1997) J. Biol. Chem. 272:21534-21539; and Wong, D.H. et al. (1997) Mol. Cell 
Biol. 17:3755-3767). Some proteins are dissociated before nuclear mRNAs are transported across the 
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NPC whole others are dissociated shortly after nuclear mRNA transport across the NPC and are 
reimported into the nucleus. 
Disease Correlation 

The etiology of numerous human diseases and disorders can be attributed to defects in the 
5 transport or secretion of proteins. For example, abnormal hormonal secretion is linked to disorders 
such as diabetes insipidus (vasopressin), hyper- and hypoglycemia (insulin, glucagon), Grave's disease 
and goiter (thyroid hormone), and Cushing's and Addison's diseases (adrenocorticotropic hormone* 
ACTH). Moreover, cancer cells secrete excessive amounts of hormones or other biologically active 

peptides. Disorders related to excessive secretion of biologically active peptides by tumor cells include 

i 

10 fasting hypoglycemia due to increased insulin secretipn from insulinoma-islet cell tumors; hypotension 
due to increased epinephrine and norepinephrine secreted from pheochromocytomas of the adrenal 
medulla and sympathetic paraganglia; and carcinoid syndrome, which is characterized by abdominal 
cramps, diarrhea, and valvular heart disease caused by excessive amounts of vasoactive substances 
such as serotonin, bradykinin, histamine, prostaglandins, and polypeptide hormones, secreted from 

1 5 intestinal tumors. Biologically active peptides that are ectopically synthesized in and secreted from 
tumor cells include ACTH and vasopressin (lung and pancreatic cancers); parathyroid hormone (lung 
and bladder cancers); calcitonin (lung and breast cancers); and thyroid-stimulating hormone (medullary 
thyroid carcinoma). Such peptides may be useful as diagnostic markers for tumorigenesis (Schwartz, 
M.Z. (1997) Semia Pediatr. Surg. 3:141-146; and Said, S.I. and G.R. Faloona (1975) N. Engl. J. Med. 

20 293:155-160). 

Defective nuclear transport may play a role in cancer. The BRCA1 protean contains three 
potential NLSs which interact with importin alpha, and is transported into the nucleus by the 
importin/NPC pathway. In breast cancer cells the BRCA1 protein is aberrantly localized in the 
cytoplasm. The mislocation of the BRCA1 protein in breast cancer ceils may be due to a defect in the 
25 NPC nuclear import pathway (Chen, C.F. et al. (1996) J. Biol. Chem. 271 :32863-32868). 

It has been suggested that in some breast cancers, the tumor-suppressing activity of p53 is 
inactivated by the sequestration of the protein in the cytoplasm, away from its site of action in the cell 
nucleus. Cytoplasmic wild-type p53 was also found in human cervical carcinoma cell lines. (Moll, 
U.M. et al. (1992) Proc. Natl. Acad. Sci. USA 89:7262-7266; and Liang, X.H. et al. (1993) Oncogene 
30 8:2645-2652.) 

Environmental Responses 

Organisms respond to the environment by a number of pathways. Heat shock proteins, 
including hsp 70, hsp60, hsp90, and hsp 40, assist organisms in coping with heat damage to cellular 
proteins. 
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Aquaporins (AQP) are channels that transport water and, in some cases, nonionic small solutes 
such as urea and glycerol. Water movement is important for a number of physiological processes 
including renal fluid filtration, aqueous humor generation in the eye, cerebrospinal fluid production in 
the brain, and appropriate hydration of the lung. Aquaporins are members of the major intrinsic protein 
5 (MIP) family of membrane transporters (King, L.S. and P. Agre (1996) Annu. Rev. Physiol. 58:619- 
648; Ishibashi, K. et al. (1997) J. Biol. Chem. 272:20782-20786). The study of aquaporins may have 
relevance to understanding edema formation and fluid balance in both normal physiology and disease 
states (King, supra) . Mutations in AQP2 cause autosomal recessive nephrogenic diabetes insipidus 
(OMIM *107777 Aquaporin 2; AQP2). Reduced AQP4 expression in skeletal muscle may be 
10 associated with Duchenne muscular dystrophy (Frigeri, A et al. (1998) J. Clin. Invest. 102:695-703). 
Mutations in AQP0 cruise autosomal dominant cataracts in the mouse (OMIM *154050 Major Intrinsic 
Protein of Lens Fiber; MIP). 

The metallothioneins (MTs) are a group of small (61 amino acids), cysteinerrich proteins that 
bind heavy metals such as cadmium, zinc, mercury, lead, and copper and are thought to play a role in 
15 metal detoxification or the metabolism and homeostasis of metals. Arsenite-resistance proteins have 
been identified in hamsters that are resistant to toxic levels of arsenite (Rossman, T.G. et al. (1997) 
Mutat Res. 386:307-314). 

Humans rfespond to light and odors by specific protein pathways. Proteins involved in light 
perception include rhodopsin, transducin, and cGMP phosphodiesterase. Proteins involved in odor 

2 o perception include multiple olfactory receptors. Other proteins are important inhuman Orcadian 

rhythms and responses to wounds. 
Immunity and Host Defense 

All vertebrates have developed sophisticated and complex immune systems that provide 
protection from viral, bacterial, fungal and parasitic infections. Included in these systems are the 
25 processes of humoral immunity, the complement cascade and the inflammatory response (Paul, WJS. 
(1993) Fundamental Immunology . Raven Press, Ltd., New York NY, pp.1-20). 

The cellular components of the humoral immune system include six different types of 
leukocytes: monocytes, lymphocytes, polymorphonuclear granulocytes <consisting of neutrophils, 
eosinophils, and basophils) and plasma cells. Additionally, fragments of megakaryocytes, a seventh 

3 o type of white blood cell in the bone marrow, occur in large numbers in the blood as platelets. 

Leukocytes are formed from two stem cell lineages in bone marrow. The myeloid stem cell 
line produces granulocytes and monocytes and, the lymphoid stem cell produces lymphocytes. 
Lymphoid cells travel to the thymus, spleen and lymph nodes, where they mature and tlifferentiate 
into lymphocytes. Leukocytes are responsible for defending the body against invading pathogens. 
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Neutrophils and monocytes attack invading bacteria, viruses, and other pathogens and destroy them 

i 

by phagocytosis. Monocytes enter tissues and differentiate into macrophages which are extremely 
phagocytic. Lymphocytes and plasma cells are a part of the immune system which recognizes 
specific foreign molecules and organisms and inactivates them, as well as signals other cells to attack 
5 the invaders. 

Granulocytes and monocytes are formed and stored in the bone marrow until needed. 
Megakaryocytes are produced in bone marrow, where they fragment into platelets and are released 
into the bloodstream. The main function of platelets is to activate the blood clotting mechanism. 
Lymphocytes and plasma cells are produced in various lymphogenous organs, including the lymph 
10 nodes, spleen, thymus, and tonsils. 

Both neutrophils and macrophages exhibit chemotaxis towards sites of inflammation. Tissue 
inflammation in response to pathogen invasion results in production of chemo-attractants for 
leukocytes, such as endotoxins or other bacterial products, prostaglandins, and products of leukocytes 
or platelets. 

as Basophils participate in the release of the chemicals involved in the inflammatory process. 

The main function of basophils is secretion of these chemicals to such a degree that thfey have been 
referred to as "unicellular endocrine glands." A distinct aspect of basophilic secretion is (hat the 
contents of granules go directly into the extracellular environment, not into vacuoles as occurs with 
neutrophils, eosinophils and monocytes. Basophils have receptors for the Fc fragment of 

20 immunoglobulin E (IgE) that are not present on other leukocytes. Crosslinking of membrane IgE 
with anti-IgE or other ligands triggers degranulation. 

Eosinophils are bi- or multi-nucleated white blood cells which contain eosinophilic granules. 
Their plasma membrane is characterized by Ig receptors, particularly IgG and IgE. Generally, 
eosinophils are stored in the bone marrow until recruited for use at a site of inflammation or invasion. 

2 5 They have specific functions in parasitic infections and allergic reactions, and are thought to detoxify 

some of the substances released by mast cells and basophils which cause inflammation. Additionally, 
they phagocytize antigen-antibody complexes and further help prevent spread of the inflammation. 

Macrophages are monocytes that have left the blood stream to settle in tissue. Once 
monocytes have migrated into tissues, they do not re-enter the bloodstream. The mononuclear 

3 o phagocyte system is comprised of precursor cells in the bone marrow, monocytes in circulation, and 

macrophages in tissues. The system is capable of very fast and extensive phagocytosis. A 
macrophage may phagocytize over 100 bacteria, digest them and extrude residues, and then survive 
for many more months. Macrophages are also capable of ingesting large particles, including red 
blood cells and malarial parasites. They increase several-fold in size and transform into macrophages 
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that are characteristic of the tissue they have entered, surviving in tissues for several months. 

Mononuclear phagocytes are essential in defending the body against invasion by foreign 
pathogens, particularly intracellular microorganisms such as M. tuberculosis , listeria, leishmania and 
toxoplasma. Macrophages can also control the growth of tumorous ceils, via both phagocytosis and 
5 secretion of hydrolytic enzymes. Another important function of macrophages is that of processing 
antigen and presenting them in a biochemically modified form to lymphocytes. 

The immune system responds to invading microorganisms in two major ways: antibody 
production and cell mediated responses. Antibodies are immunoglobulin proteins produced by 
B-lymphocytes which bind to specific antigens and cause inactivation or promote destruction of the 

10 antigen by other cells. Cell-mediated immune responses involve T-lymphocytes (T cells) that react 
with foreign antigen on the surface of infected host cells. Depending on the type of T cell, the 
infected cell is either killed or signals are secreted which activate macrophages and other cells to 
destroy the infected cell (Paul, supra) . 

T-lymphocytes originate in the bone marrow or liver in fetuses. Precursor cells migrate via 

15 the blood to the thymus, where they are processed to mature into T-lymphocytes. This processing is 
crucial because of positive and negative selection of T cells that will react with foreign antigen and 
not with self molecules. After processing, T cells continuously circulate in the blood and secondary 
lymphoid tissues, such as lymph nodes, spleen, certain epithelium-associated tissues in the 
gastrointestinal tract, respiratory tract and skin. When T-lymphocytes are presented with the 

20 complementary antigen, they are stimulated to proliferate and release large numbers of activated T 
cells into the lymph system and the blood system. Hiese activated T cells can survive and circulate 
for several days. At the same time, T memory cells are created, which remain in the lymphoid tissue 
for months or years. Upon subsequent exposure to that specific antigen, these memory cells will 
respond more rapidly and with a stronger response than induced by the original antigen. This creates 

25 an "immunological memory" that can provide immunity for years. 

There are two major types of T cells: cytotoxic T cells destroy infected host cells, and helper 
T cells activate other white blood cells via chemical signals. One class of helper cell, T H 1, activates 
macrophages to destroy ingested microorganisms, while another, T H 2, stimulates the production of 
antibodies by B cells. 

3 o Cytotoxic T cells directly attack the infected target cell. In virus-infected cells, peptides 

derived from viral proteins are generated by the proteasome. These peptides are transported into the 
ER by the transporter associated with antigen processing (TAP) <Pamer, E. and P. Cresswell <1998) 
Annu. Rev. Immunol. 16:323-358). Once inside the ER, the peptides bind MHC I chains, and the 
peptide/MHC I complex is transported to the cell surface. Receptors on the surface of T cells bind to 
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antigen presented on cell surface MHC molecules. Once activated by binding to antigen, T cells 
secrete y -interferon, a signal molecule that induces the expression of genes necessary for presenting 
viral (or other) antigens to cytotoxic T cells. Cytotoxic T cells kill the infected cell by stimulating 
programmed cell death. 

5 Helper T cells constitute up to 75% of the total T cell population. Hiey regulate the immune 

functions by producing a variety of lymphokines that act on other cells in the immune system and on 
bone marrow. Among these lymphokines are: interleukins-2,3,4,5,6; granulocyte-monocyte colony 
stimulating factor, and v-interferon. 

Helper T cells are required for most B cells to respond to antigen. When an activated helper 

10 cell contacts a B cell, its centrosome and Golgi apparatus become oriented toward the B cell, aiding 
the directing of signal molecules, such as transmembrane-bound protein called CD40 ligand, onto the 
B cell surface to interact with the CD40 transmembrane protein. Secreted signals also help B cells to 
proliferate and mature and, in some cases, to switch the class of antibody being produced. 

B-lymphocytes (B cells) produce antibodies which react with specific antigenic proteins 

15 ( presented by pathogens. Once activated, B cells become filled with extensive rough endoplasmic 
reticulum and are known as plasma cells. As with T cells, interaction of B cells with antigen 
stimulates proliferation of only those B cells which produce antibody specific to that antigen. There 
are five classes of antibodies, known as immunoglobulins, which together comprise about 20% of 
total plasma protein. Each class mediates a characteristic biological response after antigen binding. 

20 Upon activation by specific antigen B cells switch from making membrane-bound antibody to 
secretion of that antibody. 

Antibodies, or immunoglobulins (Ig), are the founding members of the Ig superfamily and the 
central components of the humoral immune response. Antibodies are either expressed on the surface 
of B cells or secreted by B cells into the circulation. Antibodies bind and neutralize blood-borne 

2 5 foreign antigens. Hie prototypical antibody is a tetr amear consisting of two identical heavy 

polypeptide chains (H-chains) and two identical light polypeptide chains (L-chains) interlinked by 
disulfide bonds. This arrangement confers the characteristic Y-shape to antibody molecules. 
Antibodies are classified based on their H-chain composition. The five antibody classes, IgA, IgD, 
IgE, IgG and IgM, are defined by the a, 6, e, y, and |i H-chain types. There are two types of L- 

30 chains, k and A, either of which may associate as a pair with any H-chain pair. IgG, the most 

common class of antibody found in the circulation, is tetrameric, while the other classes of antibodies 
are generally variants or multimers of this basic structure. 

H-chains and L-chains each contain an N-terminal variable region and a C -terminal constant 
region. Both H-chains and L-chains contain repeated Ig domains. For example, a typical H-chain 
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contains four Ig domains, three of which occur within the constant region and one of which occurs 
within the variable region and contributes to the formation of the antigen recognition site. Likewise, 
a typical L-chain contains two Ig domains, one of which occurs within the constant region and one of 
which occurs within the variable region. In addition, H chains such as ji have been shown to 
5 associate with other polypeptides during differentiation of the B cell. 

Antibodies can be described in terms of their two main functional domains. Antigen 
recognition is mediated by the Fab (antigen binding fragment) region of the antibody, while effector 
functions are mediated by the Fc (crystallizable fragment) region. Binding of antibody to an antigen, 
such as a bacterium, triggers the destruction of the antigen by phagocytic white blood cells such as 

10 macrophages and neutrophils. These cells express surface receptors that specifically bind to the 

antibody Fc region and allow the phagocytic cells to engulf, ingest, and degrade the antibody-bound 
antigen. The Fc receptors expressed by phagocytic cells are single-pass transmembrane glycoproteins 
of about 300 to 400 amino acids (Sears, D.W. et al. (1990) J. Immunol. 144:371-378). The 

, extracellular portion of the Fc receptor typically contains two or three Ig domains. 

1 5 Diseases which cause over- or under-abundance of any one type of leukocyte usually result in 

the entire immune defense system becoming involved. A well-known autoimmune disease is AIDS 
(Acquired Immunodeficiency Syndrome) where the number of helper T cells is depleted, leaving the 
patient susceptible to infection by microorganisms and parasites. Another widespread medical 
condition attributable to the immune system is that of allergic reactions to certain antigens. Allergic 

20 reactions include: hay fever, asthma, anaphylaxis, and urticaria (hives). Leukemias are an excess 

production of white blood cells, to the point where a major portion of the body' s metabolic resources 
are directed solely at proliferation of white blood cells, leaving other tissues to starve. Leukopenia or 
agranulocytosis occurs when the bone marrow stops producing white blood cells. This leaves the 
body unprotected against foreign microorganisms, including those which normally inhabit skin, 

2 5 mucous membranes, and gastrointestinal tract. If all white blood cell production stops completely, 

infection will occur within two days and death may follow only 1 to 4 days latex. 

Impaired phagocytosis occurs in several diseases, including monocytic leukemia, systemic • 
lupus, and granulomatous disease. In such a situation, macrophages can phagocytize normally, but 
the enveloped organism is not killed. A defect in the plasma membrane enzyme which converts 

3 o oxygen to lethally reactive forms results in abscess formation in liver, lungs, spleen, lymph nodes, 

and beneath the skin. Eosinophilia is an excess of eosinophils commonly observed in patients with 
allergies (hay fever, asthma), allergic reactions to drugs, rheumatoid arthritis, and cancers (Hodgkin's 
disease, lung, and liver cancer) (Isselbacher, KJ. et al. (1994) Harrison's Principles of Internal 
Medicine . McGraw-Hill, Inc., New York NY). 
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Host defense is further augmented by the complement system. The complement system 
serves as an effector system and is involved in infectious agent recognition. It can function as an 
independent immune network or in conjunction with other humoral immune responses. The 
complement system is comprised of numerous plasma and membrane proteins that act in a cascade of 
5 reaction sequences whereby one component activates the next Hie result is a rapid and amplified 
response to infection through either an inflammatory response or increased phagocytosis. 

The complement system has more than 30 protein components which can be divided into 
functional groupings including modified serine proteases, membrane-binding proteins and regulators 
of complement activation. Activation occurs through two different pathways the classical and the 
10 alternative. Both pathways serve to destroy infectious agents through distinct triggering mechanisms 
that eventually merge with the involvement of the component C3. 

Hie classical pathway requires antibody binding to infectious agent antigens. Hie antibodies 
serve to define the target and initiate the complement system cascade, culminating in, the destruction 
of the infectious agent In this pathway, since the antibody guides initiation of the process, the 
15 complement can be seen as an effector arm of the humoral immune system. 

The alternative pathway of the complement system does not require the presence of pre- 
existing antibodies for targeting infectious agent destruction. Rather, this pathway, through low 
levels of an activated component, remains constantly primed and provides surveillance in the non- 
immune host to eriable targeting and, destruction of infectious agents. In this case foreign material 
20 triggers the cascade, thereby facilitating phagocytosis or lysis (Paul, supra , pp.918-919). 

Another important component of host defense is the process of inflammation. Inflammatory 
responses are divided into four categories on the basis of pathology and include allergic 
inflammation, cytotoxic antibody mediated inflammation, immune complex mediated inflammation 
and monocyte mediated inflammation. Inflammation manifests as a combination of each of these 
2 5 forms with one predominating. 

Allergic acute inflammation is observed in individuals wherein specific antigens stimulate 
IgE antibody production. Mast cells and basophils are subsequently activated by the attachment of 
antigen-IgE complexes, resulting in the release of cytoplasmic granule contents such as histamine. 
The products of activated mast cells can increase vascular permeability and constrict the smooth 
30 muscle of breathing passages, resulting in anaphylaxis or asthma. Acute inflammation is also 
mediated by cytotoxic antibodies and can result in the destruction of tissue through the binding of 
complement-fixing antibodies to cells. The responsible antibodies are of the IgG or IgM types. 
Resultant clinical disorders include autoimmune hemolytic anemia and thrombocytopenia as 
associated with systemic lupus erythematosis. 
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Immune complex mediated acute inflammation involves the IgG or IgM antibody types 
which combine with antigen to activate the complement cascade. When such immune complexes 
bind to neutrophils and macrophages they activate the respiratory burst to form protein- and vessel- 
damaging agents such as hydrogen peroxide, hydroxyl radical, hypocMorous acid, and cMoramines. 
5 Clinical manifestations include rheumatoid arthritis and systemic lupus erythematosus: 

In chronic inflammation or delayed-type hypersensitivity, macrophages are activated and 
process antigen for presentation to T cells that subsequently produce lymphokines and monokines. 

This type of inflammatory response is likely important for defense against intracellular parasites and 

i 

certain viruses. Clinical associations include, granulomatous disease, tuberculosis, leprosy, and 
10 sarcoidosis (Paul, W.E., supra, pp.1017-1018). 

Extracellular Information Transmission Molecules 

Intercellular communication is essential for the growth and survival of multicellular 
organisms, and in particular, for the function of the endocrine, nervous, and immune systems. In 

15 addition, intercellular communication is critical for developmental processes such as tissue 

construction and organogenesis, in which cell proliferation, cell differentiation, and morphogenesis 
must be spatially and temporally regulated in a precise and coordinated manner. Cells communicate 
with one another through the secretion and uptake of diverse types of signaling molecules such as 
hormones, growth factors, neuropeptides, and cytokines. 

20 Hormones 

Hormones are signaling molecules that coordinately regulate basic physiological processes 
from embryogenesis throughout adulthood. These processes include metabolism, respiration, 
reproduction, excretion, fetal tissue differentiation and organogenesis, growth and development, 
homeostasis, and the stress response. Hormonal secretions and the nervous system are tightly 

2 5 integrated and interdependent Hormones are secreted by endocrine glands, primarily the hypothalamus 

and pituitary, the thyroid and parathyroid, the pancreas, the adrenal glands, and the ovaries and testes. 

The secretion of hormones into the circulation is tightly controlled. Hormones are often 
secreted in diurnal, pulsatile, and cyclic patterns. Hormone secretion is regulated by perturbations in 
blood biochemistry, by other upstream-acting hormones, by neural impulses, and by negative feedback 

3 o loops. Blood hormone concentrations are constantly monitored and adjusted to maintain optimal, 

steady-state levels. Once secreted, hormones act only on those target cells that -express specific 
receptors. 

Most disorders of the endocrine system are caused by either hyposecretion or hypersecretion of 
hormones. Hyposecretion often occurs when a hormone's gland of origin is damaged or otherwise 
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impaired Hypo-secretion often results from the proliferation of tumors derived from hormone-secreting 
cells. Inappropriate hormone levels may also be caused by defects in regulatory feedback loops or in 
the processing of hormone precursors. Endocrine malfunction may also occur when the target cell fails 
to respond to the hormone. 
5 Hormones can be classified biochemically as polypeptides, steroids, eicosanoids, or amines. 

Polypeptides, which include diverse hormones such as insulin and growth hormone, vary in size and 1 
function and are often synthesized as inactive precursors that are processed intracellularly into mature, 
active forms. Amines, which include epinephrine and dopamine, are amino add derivatives that 
function in neuroendocrine signaling. Steroids, which include the cholesterol-derived hormones 
10 estrogen and testosterone, function in sexual development and reproduction. Eicosanoids, which 
include prostaglandins and prostacyclins, are fatty acid derivatives that function in a variety of 
processes. Most polypeptides and some amines are soluble in the circulation where they are highly 
susceptible to proteolytic degradation within seconds after their secretion. Steroids and lipids are 1 
insoluble and must be transported in the circulation by carrier proteins. The following discussion will 

i 

15 ( focus primarily on polypeptide hormones. 

Hormones secreted by the hypothalamus and pituitary gland play a critical role in endocrine 
function by coordinately regulating hormonal secretions from other endocrine glands in response to 
neural signals. Hypothalamic hormones include thyrotropin-releasing hormone, gonadotropin-reieasing 
hormone, somatostatin, growth-hormone releasing factor, corticotropin-releasing hormone, substance P, 

2 o dopamine, and prolactin-releasing hormone. These hormones directly regulate the secretion of 

hormones from the anterior lobe of the pituitary. Hormones secreted by the anterior pituitary include 
adrenocorticotropic hormone (ACTH), mdanocyte-stimulating hormone, somatotropic hormones such 
as growth hormone and prolactin, glycoprotein hormones such as thyroid-stimulating hormone, 
luteinizing hormone (LH), and follicle-stimulating hormone (FSH), P-lipotropin, and P-endorphins. 

2 5 These hormones regulate hormonal secretions from the thyroid, pancreas, and adrenal glands, and act 
directly on the reproductive organs to stimulate ovulation and spermatogenesis. The posterior pituitary 
synthesizes and secretes antidiuretic hormone (ADH, vasopressin) and oxytocin. 

Disorders of the hypothalamus and pituitary often result from lesions such as primary brain 
tumors, adenomas, infarction associated with pregnancy, hypophysectomy, aneurysms, vascular 

30 malformations, thrombosis, infections, immunological disorders, and complications due to head trauma. 
Such disorders have profound effects on the function of other endocrine glands. Disorders associated 
with hypopituitarism include hypogonadism, Sheehan syndrome, diabetes insipidus, Kallman's disease, 
Hand-Schuller-Christian disease, Letterer-Siwe disease, sarcoidosis, empty sella syndrome, and 
dwarfism. Disorders associated with hypopituitarism include acromegaly, giantism, and syndrome of 
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inappropriate ADH secretion (SIADH), often caused by benign adenomas. 

Hormones secreted by the thyroid and parathyroid primarily control metabolic rates and the 
regulation of serum calcium levels, respectively. Thyroid hormones include calcitonin, somatostatin, 
and thyroid hormone. The parathyroid secretes parathyroid hormone. Disorders associated with 
5 hypothyroidism include goiter, myxedema, acute thyroiditis associated with bacterial infection, 

subacute thyroiditis associated with viral infection, autoimmune thyroiditis (Hashimoto's disease), and 
cretinism. Disorders associated with hyperthyroidism include thyrotoxicosis and its various forms, 
Grave's disease, pretibial myxedema, toxic multinodular goiter, thyroid carcinoma, and Plummer's 
disease. Disorders associated with hyperparathyroidism include Conn disease (chronic hypercalemia) 

10 leading to bone resorption and parathyroid hyperplasia. 

Hormones secreted by the pancreas regulate blood glucose levels by modulating the rates of 
carbohydrate, fat, and protein metabolism. Pancreatic hormones include insulin, glucagon, amylin, y- 
aminobutyric acid, gastrin, somatostatin, and pancreatic polypeptide. The principal disorder associated 
with pancreatic dysfunction is diabetes mellitus caused by insufficient insulin activity. Diabetes 

15 mellitus is generally classified as either Type I (insulin-dependent, juvenile diabetes) or Type II (non- 
insulin-dependent, adult diabetes). The treatment of both forms by insulin replacement therapy is well 
known. Diabetes mellitus often leads to acute complications such as hypoglycemia (insulin shock), 
coma, diabetic ketoacidosis, lactic acidosis, and chronic complications leading to disorders of the eye, 
kidney, skin, bone, joint, cardiovascular system, nervous system, and to decreased resistance to 

20 infection. 

The anatomy, physiology, and diseases related to hormonal function are reviewed in McCance, 
KX. and S.E. Huether (1994) Pathophysiology: The Biological Basis for Disease in Adults and 
Children . Mosby-Year Book, Inc., St Louis MO; Greenspan, F.S. and J.D. Baxter (1994) Basic and 
Clinical Endocrinology . Appleton and Lange, East Norwalk CT. 
25 Growth Factors 

Growth factors are secreted proteins that mediate intercellular communication. Unlike 
hormones, which travel great distances via the circulatory system, most growth factors are primarily 
local mediators that act on neighboring cells. Most growth factors contain a hydrophobic N-terminal 
signal peptide sequence which directs the growth factor into the secretory pathway. Most growth 
30 factors also undergo post-translational modifications within the secretory pathway. These 

modifications can include proteolysis, glycosylation, phosphorylation, and intramolecular disulfide bond 
formation. Once secreted, growth factors bind to specific receptors on the surfaces of neighboring 
target cells, and the bound receptors trigger intracellular signal transduction pathways. These signal 
transduction pathways elicit specific cellular responses in the target cefls. These responses can include 
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the modulation of gene expression and the stimulation or inhibition of cell division, cell differentiation, 
and cell motility. 

Growth factors fall into at least two broad and overlapping classes. The broadest class 
includes the large polypeptide growth factors, which are wide-ranging in their effects. These factors 
5 include epidermal growth factor (EGF), fibroblast growth factor (FGF), transforming growth factor-P 
(TGF-P), insulin-like growth factor (IGF), nerve growth factor (NGF), and platelet-derived growth 
factor (PDGF), each defining a family of numerous related factors. The large polypeptide growth 
factors, with the exception of NGF, act as mitogens on diverse cell types to stimulate wound healing, 

bone synthesis and remodeling, extracellular matrix synthesis, and proliferation of epithelial, , 

i 

10 epidermal, and connective tissues. Members of the TGF-P, EGF, and FGF families also function as 
inductive signals in the differentiation of embryonic tissue. NGF functions specifically as a 
neurotrophic factor, promoting neuronal growth and differentiation. 

Another class of growth factors includes the hematopoietic growth factors, which are narrow in 
their target specificity. These factors stimulate the proliferation and differentiation of blood cells such 

15 as B-lymphocytes, T-lymphocytes, erythrocytes, platelets, eosinophils, basophils, neutrophils, 

macrophages, and their stem cell precursors. These factors include the colony-stimulating factors (G- 
CSF, M-CSF, GM-CSF, and CSF1-3), erythropoietin, and the cytokines. The cytokines are specialized 
hematopoietic factors secreted by cells of the immune system and are discussed in detail below. 

Growth factors play critical roles in neoplastic transformation of cells in vitro and in tumor 

20 progression in vivo . Overexpression of the large polypeptide growth factors promotes the 

proliferation and transformation of cells in culture. Inappropriate expression of these growth factors 
by tumor cells in vivo may contribute to tumor vascularization and metastasis. Inappropriate activity 
of hematopoietic growth factors can result in anemias, leukemias, and lymphomas. Moreover, growth 
factors are both structurally and functionally related to oncoproteins, the potentially cancer-causing 

2 5 products of proto-oncogenes. Certain FGF and PDGF family members are themselves homologous to 

oncoproteins, whereas receptors for some members of the EGF, NGF, and FGF families are encoded 
by proto-oncogenes. Growth factors also affect the transcriptional regulation of both proto-oncogenes 
and oncosuppressor genes (Pimentel, E. (1994) Handbook of Growth Factors . CRC Press, Ann Arbor 
MI; McKay, I. and I. Leigh, eds. (1993) Growth Factors: A Practical Approach , Oxford University 

3 o Press, New York NY; Habenicht, A., ed. (1990) Growth Factors. Differentiation Factors, and 

Cytokines . Springer- Verlag, New York NY). 

In addition, some of the large polypeptide growth factors play crucial roles in the induction of 
the primordial germ layers in the developing embryo. This induction ultimately results in the formation 
of the embryonic mesoderm, ectoderm, and endoderm which in turn provide the framework for the 
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entire adult body plan. Disruption of this inductive process' would be catastrophic to embryonic 
development. 

Small Peptide Factors - Neuropeptides and Vasomediators 

Neuropeptides and vasomediators (NP/VM) comprise a family of small peptide factors, 
5 typically of 20 amino acids or less. These factors generally function in neuronal excitation and 
inhibition of vasoconstriction/vasodilation, muscle contraction, and hormonal secretions from the 
brain and other endocrine tissues. Included in this family are neuropeptides and neuropeptide 
hormones such as bombesin, neuropeptide Y, neurotensin, neuromedin N, melanocortins, opioids, 
galanin, somatostatin, tachykinins, urotensin n and related peptides involved in smooth muscle 
10 stimulation, vasopressin, vasoactive intestinal peptide, and circulatory system-borne signaling 

molecules such as angiotensin, complement, calcitonin, endothelins, formyl-methionyl peptides, 

i 

glucagon, cholecystokinin, gastrin, and many of the peptide hormones discussed above. NP/VMs can 
transduce signals directly, modulate the activity or release of other neurotransmitters and hormones, and 

1 act as catalytic enzymes in signaling cascades. The effects of NP/VMs range from extremely brief to 
15 long-lasting, (Reviewed in Martin, C.R. et al. (1985) Endocrine Physiology, Oxford University Press, 

New York NY, pp. 57-62.) 
Cytokines 

Cytokines comprise a family of signaling molecules that modulate the immune system and the - 
inflammatory response. Cytokines are usually secreted by leukocytes, or white blood cells, in response 
20 to injury or infection. Cytokines function as growth and differentiation factors that act primarily on 
cells of the immune system such as B- and T-lymphocytes, monocytes, macrophages, and granulocytes. 
Like other signaling molecules, cytokines bind to specific plasma membrane receptors and trigger 
intracellular signal transduction pathways which alter gene expression patterns. There is considerable 
potential for the use of cytokines in the treatment of inflammation and immune system disorders. 

2 5 Cytokine structure and function have been extensively characterized in vitro . Most cytokines 

are small polypeptides of about 30 kilodaltons or less. Over 50 cytokines have been identified from 
human and rodent sources. Examples of cytokine subfamilies include the interferons (IFN-a, -p, and - 
Y), the interleukins (EL1-IL13), the tumor necrosis factors (TNF-a and -p), and the chemokines. Many 
cytokines have been produced using recombinant DNA techniques, and the activities of individual 

3 o cytokines have been determined in vitro . These activities include regulation of leukocyte proliferation, 

differentiation, and motility. 

The activity of an individual cytokine in vitro may not reflect the full scope of that cytokine's 
activity in vivo . Cytokines are not expressed individually in vivo but are instead expressed in 
combination with a multitude of other cytokines when the organism is challenged with a stimulus. 
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Together, these cytokines collectively modulate the immune response in a manna* appropriate for that 
particular stimulus. Therefore, the physiological activity of a cytokine is determined by the stimulus 
itself and by .complex interactive networks among co-expressed cytokines which may demonstrate both 
synergistic and antagonistic relationships. 
5 Chemokines comprise a cytokine subfamily with over 30 members. (Reviewed in Wells, T. 

N.C. and M.C. Peitsch (1997) J. Leukoc. Biol. 61:545-550.) . Chemokines were initially identified as 
chemotactic proteins that recruit monocytes and macrophages to sites of inflammation. Recent evidence 
indicates that chemokines may also play key roles in hematopoiesis and HTV-1 infection. Chemokines 
are small proteins which range from about 6-15 kilodaltons in molecular weight. Chemokines are 

10 further classified as C, CC, CXC, or CX 3 C based on the number and position of critical cysteine 
residues. The CC chemokines, for example, each contain a conserved motif consisting of two 
consecutive cysteines followed by two additional cysteines which occur downstream at 24- and 16- 
residue intervals, respectively (ExPASy PROSITE database, documents PS00472 and,PDOC00434). 
The presence and spacing of these four cysteine residues are highly conserved, whereas the intervening 

15 residues diverge significantly. However, a conserved tyrosine located about 15 residues downstream of 
the cysteine doublet seems to be important for chemotactic activity. Most of the human genes encoding 
CC chemokines are clustered on chromosome 17, although there are a few examples of CC chemokine 
genes that map elsewhere. Other chemokines include lymphotactin (C chemokine); macrophage 
chemotactic and activating factor (MCAF/MCP-1; CC chemokine); platelet factor 4 and IL-8 (CXC 

20 chemokines); and fractalkine and neurotractin (CXgC chemokines). (Reviewed in Luster, A.D. (1998) 
N. Engl. J. Med. 338:436-445.) 

Receptor Molecules 

The term receptor describes proteins that specifically recognize other molecules. The category 
25 is broad and includes proteins with a variety of functions. The bulk of receptors are cell surface 
proteins which bind extracellular ligands and produce cellular responses in the areas of growth, 
differentiation, endocytosis, and immune response. Other receptors facilitate the selective transport of 
proteins out of the endoplasmic reticulum and localize enzymes to particular locations in the cell. Hie 
term may also be applied to proteins which act as receptors for ligands with known or unknown 
3 o chemical composition and which interact with other cellular components. For example, the steroid 
hormone receptors bind to and regulate transcription of DNA. 

Regulation of cell proliferation, differentiation, and migration is important for the formation 
and function of tissues. Regulatory proteins such as growth factors coordinately control these cellular 
processes and act as mediators in cell-cell signaling pathways. Growth factors are secreted proteins 
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that bind to specific cell-surface receptors on target cells. The bound receptors trigger intracellular 
signal transduction pathways which activate, various downstream effectors that regulate gene 
expression, cell division, cell differentiation, cell motility, and other cellular processes. 

Cell surface receptors are typically integral plasma membrane proteins. These receptors 
5 recognize hormones such as catecholamines; peptide hormones; growth and differentiation factors; 
small peptide factors such as thyrotropin-releasing hormone; galanin, somatostatin, and tachykinins; 
and circulatory system-borne signaling molecules. Cell surface receptors on immune system ceils 
recognize antigens, antibodies, and major histocompatibility complex (MHC)-bound peptides. Other 
cell surface receptors bind ligands to be internalized by the cell. This receptor-mediated endocytosis 

1 o functions in the uptake of low density lipoproteins (LDL), transferrin, glucose- or mannose-terminal 

glycoproteins, galactose-terminal gjycoproteins, immunoglobulins, phosphovitellogenins, fibrin, 
protrinase-inhibitor complexes, plasminogen activators, and thrombospondin (Lodish, H. et al. (1995) 
Molecular Cell Biology . Scientific American Books, New York NY, p. 723; Mikhailenko, I. et al. 
(1997) J. Biol. Chem. 272:6784-6791). 
15 Receptor Protein Kinases ^ 
Many growth factor receptors, including receptors for epidermal growth factor, 
platelet-derived growth factor, fibroblast growth factor, as well as the growth modulator a-thrombin, 
contain intrinsic prptein kinase activities. When growth factor binds to the receptor, it triggers the 
autophosphorylation of a serine, threonine, or tyrosine residue on the receptor. These phosphorylated 

2 o sites are recognition sites for the binding of other cytoplasmic signaling proteins. These proteins 

participate in signaling pathways that eventually link the initial receptor activation at the cell surface to 
the activation of a specific intracellular target molecule* In the case of tyrosine residue 
autophosphorylation, these signaling proteins contain a common domain referred to as a Src homology 
(SH) domain. SH2 domains and SH3 domains are found in phospholipase C-y, PI-3-K p85 regulatory 
25 subunit, Ras-GTPase activating protein, and ppeCT^ (Lowenstein, E.J. et al. (1992) Cell 70:431-442). 
The cytokine family of receptors share a different common binding domain and include transmembrane 
receptors for growth hormone (GH), interleukins, erythropoietin, and prolactin. 

Other receptors and second messenger-binding proteins have intrinsic serine/threonine protein 
kinase activity. These include activin/TGF-p/BMP-superfamily receptors, calcium- and diacylglycerol- 

3 o activated/phospholipid-dependant protein kinase (PK.-C), and RNA-dependant protein kinase (PK-R). 

In addition, other serine/threonine protein kinases, including nematode Twitchin, have fibronectin-like, 
immunoglobulin C2-like domains. 
G-Protein Coupled Receptors 

G-protein coupled receptors (GPCRs) are integral membrane proteins characterized by the 
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! presence of seven hydrophobic transmembrane domains which span the plasma membrane and form a 
bundle of antiparaHel alpha (a) helices. These proteins range in size from undo: 400 to over 1000 
amino acids (Strosberg, A.D. (1991) Eur. J. Biochem. 196:1-10; CougMin, S.R. (1994) Cuir. Opia 
Cell BioL 6:191-197). The ammo-terminus of the GPCR is extracellular, of variable length and often 
5 glycosylated; the carboxy-terminus is cytoplasmic and generally phosphorylated. Extracellular loops of 
the GPCR alternate with intracellular loops and link the transmembrane domains. The most conserved 
domains of GPCRs are the transmembrane domains and the first two cytoplasmic loops. The 
transmembrane domains account for structural and functional features of the receptor. In most cases, 
the bundle of a helices forms a binding pocket. In addition, the extracellular N-terminal segment or one 
10 or more of the three extracellular loops may also participate in ligand binding. Ligand binding activates 
the receptor by inducing a conformational change in intracellular portions of the receptor. The 
activated receptor, in turn, interacts with an intracellular heterotrimeric guanine nucleotide binding (G) 
protein complex which mediates further intracellular signaling activities, generally the production of 
second messengers such as cyclic AMP (cAMP), phospholipase C, inositol triphosphate, or interactions 

i 

15 . with ion channel proteins (Baldwin, J.M. (1994) Curr. Opia Cell Biol. 6:180-190). 

GPCRs include those for acetylcholine, adenosine, epinephrine and norepinephrine, bombesin,' 
bradykinin, chemokines, dopamine, endothdin, y-aminobutyric acid (GAB A), follicle-stimulating 
hormone (FSH), glutamate, gonadotropin-releasing hormone (GnRH), hepatocyte growth facto, 
histamine, leukotrienes, melanocortins, neuropeptide Y, opioid peptides, opsins, prostanoids, serotonin, 

2 o somatostatin, tachykinins, thrombin, thyrotropin-releasing hormone (TRH), vasoactive intestinal 

polypeptide family, vasopressin and oxytocin, and orphan receptors. 

GPCR mutations, which may cause loss of function or constitutive activation, have been 
associated with numerous human diseases (Coughlin, supra) . For instance, retinitis pigmentosa may 
arise from mutations in the rhodopsin gene. Rhodopsin is the retinal photoreceptor which is located 
25 within the discs of the eye rod cdl. Parma, J. et al. (1993, Nature 365:649-651) report that somatic 
activating mutations in the thyrotropin receptor cause hyperfunctioning thyroid adenomas and suggest 
that certain GPCRs susceptible to constitutive activation may behave as protooncogenes. 
Nuclear Receptors 

Nuclear receptors bind small molecules such as hormones or second messengers, leading to 

3 o increased receptor-binding affinity to specific chromosomal DNA dements. In addition the affinity for 

other nuclear proteins may also be altered. Such binding and protein-protein interactions may regulate 
and modulate gene expression. Examples of such receptors include the steroid hormone receptors 
family, the retinoic acid receptors family, and the thyroid hormone receptors f amfly. 
Ligand-Gated Receptor Ion Channels 
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Ligand-gated receptor ion channels fall into two categories. The first category, extracellular 
ligand-gated receptor ion channels (ELGs), rapidly transduce neurotransmitter-binding events into 
electrical signals, such as fast synaptic neurotransmission. ELG function is regulated by post- 
translational modification. The second category, intracellular ligand-gated receptor ion channels 
5 (ILGs), are activated by many intracellular second messengers and do not require post-translational 
modification(s) to effect a channel-opening response. 

ELGs depolarize excitable cells to the threshold of action potential generatioa In non-excitable 
cells, ELGs permit a limited calcium ion-influx during the presence of agonist. ELGs include channels 
directly gated by neurotransmitters such as acetylcholine, L-glutamate, glycine, ATP, serotonin, 

l o GAB A, and histamine. ELG genes encode proteins having strong structural and functional similarities. 
ILGs are encoded by distinct and unrelated gene families and include receptors for cAMP, cGMP, 
calcium ions, ATP, and metabolites of arachidonic add. 
Macrophage Scavenger Receptors 

Macrophage scavenger receptors with broad ligand specificity may participate in the binding of 

15 low density lipoproteins (LDL) and foreign antigens. Scavenger receptors types I and II are trimeric 
membrane proteins with each subunit containing a small N-terminal intracellular domain, a 
transmembrane domain, a large extracellular domain, and a C-terminal cysteinenrich domain: The 
extracellular domain contains a short spacer domain, an a-helical coiled-coil domain, and a triple helical 
collagenous domain. These receptors have been shown to bind a spectrum of ligands, including 

20 chemically modified lipoproteins and albumin, polyribonucleotides, polysaccharides, phospholipids, and 
asbestos (Matsumoto, A et al. (1990) Proc. Natl. Acad. Sci. USA 87:9133-9137; Elomaa, O. et al. 
(1995) Cell 80:603-609). The scavenger receptors are thought to play a key role in atherogenesis by 
mediating uptake of modified LDL in arterial walls, and in hostdefense by binding bacterial 
endotoxins, bacteria, and protozoa. 

25 T-Cell Receptors 

T cells play a dual role in the immune system as effectors and regulators, coupling antigen 
recognition with the transmission of signals that induce cell death in infected cells and stimulate 
proliferation of other immune cells. Although a population of T cells can recognize a wide range of 
different antigens, an individual T cell can only recognize a single antigen and only when it is presented 

30 to the T cell receptor (TCR) as a peptide complexed with a major histocompatibility molecule (MHC) 
on the surface of an antigen presenting cell. The TCR on most T cells consists of immunoglobulin-like 
integral membrane glycoproteins containing two polypeptide subunits, a and f3, of similar molecular 
weight. Both TCR subunits have an extracellular domain containing both variable and constant 
regions, a transmembrane domain that traverses the membrane once, and a short intracellular domain 
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(Saito, H. et al. (1984) Nature 309:757-762). The genes for the TCR subunits are constructed through 
somatic rearrangement of different gene segments. Interaction of antigen in the proper MHC context 
with the TCR initiates signaling cascades that induce the proliferation, maturation, and function of 
cellular components of the immune system (Weiss, A. (1991) Annu. Rev. Genet 25:487-510). 
5 Rearrangements in TCR genes and alterations in TCR expression have been noted in lymphomas, 
leukemias, autoimmune disorders, and immunodeficiency disorders (Aisenberg, A.C. et'al. (1985) N. 
Engl. J. Med 313:529-533; Weiss, su^a). , 

Intracellular Signaling Molecules 

i 

10 Intracellular signaling is the general process by which cells respond to extracellular signals 

(hormones, neurotransmitters, growth and differentiation factors, etc.) through a cascade of 
biochemical reactions that begins with the binding of a signaling molecule to a cell membrane 
receptor and ends with the activation of an intracellular target molecule. Intermediate steps in the 
process involve the activation of various cytoplasmic proteins by phosphorylation via protein kinases, 

15 and their deactivation by protein phosphatases, and the eventual translocation of some of these 
activated proteins to the cell nucleus where the transcription of specific genes is triggered. The 
intracellular signaling process regulates all types of cell functions including cell proliferation, cell 
differentiation, and gene transcription, and involves a diversity of molecules including protein kinases 
and phosphatases, "and second messenger molecules, such as cyclic nucleotides, calcium-calmodulin, 

2 o inositol, and various mitogens, that regulate protein phosphorylation. 

Protein Phosphorylation 

Protein kinases and phosphatases play a key role in the intracellular signaling process by 
controlling the phosphorylation and activation of various signaling proteins. The high energy 
phosphate for this reaction is generally transferred from the adenosine triphosphate molecule (ATP) to 
25 a particular protein by a protein kinase and removed from that protein by a protein phosphatase. 
Protein kinases are roughly divided into two groups: those that phosphorylate tyrosine residues 
(protein tyrosine kinases, PTK) and those that phosphorylate serine or threonine residues 
(serine/threonine kinases, STK). A few protein kinases have dual specificity for serine/threonine and 
tyrosine residues. Almost all kinases contain a conserved 250-300 amino acid catalytic domain 

3 o containing specific residues and sequence motifs characteristic of the kinase family (Hardie, G. and S. 

Hanks (1995) The Protein Kinase Facts Books . Vol 1:7-20, Academic Press, San Diego CA). 

STKs include the second messenger dependent protein kinases such as the cyclic- AMP 
dependent protein kinases (PKA), involved in mediating hormone-induced cellular responses; 
calcium-calmodulin (CaM) dependent protein kinases, involved in regulation of smooth muscle 
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contraction, glycogen breakdown, and neurotransmission; and the mitogen-activated protein kinases 
(MAP) which mediate signal transduction from the cell surface to the nucleus via phosphorylation 
cascades. Altered PKA expression is implicated in a variety of disorders and, diseases including 
cancer, thyroid disorders, diabetes, atherosclerosis, and cardiovascular disease (Isselbacher, K.J. et al. 
5 (1994) Harrison's Principles of Internal Medicine McGraw-Hill, New York NY, pp. 416-431, 1887). 

PTKs are divided into transmembrane, receptor PTKs ajid nontransmembrane, non-receptor 
PTKs. Transmembrane PTKs are receptors for most growth factors. Non-receptor PTKs lack 
transmembrane regions and, instead, form complexes with the intracellular regions of cell surface 
receptors. Receptors that function through non-receptor PTfcs include those for cytokines and 

10 hormones (growth hormone and prolactin) and antigen-specific receptors on T and B lymphocytes. 
Many of these PTKs were first identified as the products of mutant oncogenes in cancer cells in which 
their activation was no longer subject to normal cellular controls. In fact, about one third of the 
known oncogenes encode PTKs, and it is well known that cellular transformation (oncogenesis) is 
often accompanied by increased tyrosine phosphorylation activity (Charbonneau, H. and N.K. Tonks 

is (1992) Annu. Rev. Cell Biol. 8:463-493). 

An additional family of protein kinases previously thought to exist only in procaryotes is the • 
histidine protein kinase family (HPK). HPKs bear little homology with mammalian STKs or PTKs 
but have distinctive sequence motifs of their own (Davie, J.R. et al. (1995) J. Biol. Chem. ; 
270:19861-19867). A histidine residue in the N-terminal half of the molecule (region I) is an 

20 autophosphorylation site. Three additional motifs located in the C-terminal half of the molecule 
include an invariant asparagine residue in region II and two glycine-rich loops characteristic of 
nucleotide binding domains in regions HI and IV. Recently a branched chain alpha-ketoacid 
dehydrogenase kinase has been found with characteristics of HPK in rat (Davie, sunra) . 

Protein phosphatases regulate the effects of protein kinases by removing phosphate groups 

25 from molecules previously activated by kinases. The two principal categories of protein phosphatases 
are the protein (serine/threonine) phosphatases (PPs) and the protein tyrosine phosphatases (PTPs). 
PPs dephosphorylate phosphoserine/threonine residues and are important regulators of many 
cAMP-mediated hormone responses (Cohen, P. (1989) Annu. Rev. Biochem. 58:453-508). PTPs 
reverse the effects of protein tyrosine kinases and play a significant role in cell cycle and cell 

30 signaling processes (Charbonneau. supra) . As previously noted, many PTKs are encoded by 

oncogenes, and oncogenesis is often accompanied by increased tyrosine phosphorylation activity. It 
is therefore possible that PTPs may prevent or reverse cell transformation and the growth of various 
cancers by controlling the levels of tyrosine phosphorylation in cells. This hypothesis is supported by 
studies showing that overexpression of PTPs can suppress transformation in cells, and that specific 
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inhibition of PTPs can enhance cell transformation (Charbonneau, supra) . 
Phospholipid and Inositol-Phosphate Signaling ■ 

■ Inositol phospholipids (phosphoinositides) are involved in an intracellular signaling pathway 
that begins with binding of a signaling molecule to a G-protein linked receptor in the plasma 
5 membrane. This leads to the phosphorylation of phosphatidylinositol (PI) residues on the inner side 
of the plasma membrane to the biphosphate state (PIPj) by inositol kinases. Simultaneously, the G- 
protein linked receptor binding stimulates a trimeric G-protein which in turn activates a 
phosphoinositide-specific phospholipase C-p. Phospholipase C-P then cleaves PIP 2 into two 
products, inositol triphosphate (JP 3 ) and diacylglycerol. These two products act as mediators for 

10 separate signaling events. IP 3 diffuses through the plasma membrane to induce calcium release from 
the endoplasmic reticulum (ER), while diacylglycerol remains in the membrane and helps activate 
protein kinase C, an STK that phosphorylates selected proteins in the target cell. The calcium 
response initiated by IP 3 is terminated by the dephosphorylation of IP 3 by specific inositol 
phosphatases. Cellular responses that are mediated by this pathway are glycogen breakdown in the 

15 liver in response to vasopressin, smooth muscle contraction in response to acetylcholine, and 
thrombin-induced platelet aggregation. 
Cyclic Nucleotide Signaling 

Cyclic nucleotides (cAMP and cGMP) function as intracellular second messengers to • 
transduce a variety of extracellular signals including hormones, light, and neurotransmitters. In 

20 particular, cyclic-AMP dependent protein kinases (PKA) are thought to account for all of the effects 
of cAMP in most mammalian cells, including various hormone-induced cellular responses. Visual 
excitation and the phototransmission of light signals in the eye is controlled by cyclic-GMP regulated, 
Ca^-specific channels. Because of the importance of cellular levels of cyclic nucleotides in 
mediating these various responses, regulating the synthesis and breakdown of cyclic nucleotides is an 

2 5 important matter. Thus adenylyl cyclase, which synthesizes cAMP from AMP, is activated to 

increase cAMP levels in muscle by binding of adrenaline to P-andrenergic receptors, while activation 
of guanylate cyclase and increased cGMP levels in photoreceptors leads to reopening of the 
Ca 2+ -specific channels and recovery of the dark state in the eye. In contrast, hydrolysis of cyclic 
nucleotides by cAMP and cGMP-specific phosphodiesterases (PDEs) produces the opposite of these 

3 o and other effects mediated by increased cyclic nucleotide levels. PDEs appear to be particularly 

important in the regulation of cyclic nucleotides, considering the diversity found in this family of 
proteins. At least seven families of mammalian PDEs (PDE1-7) have been identified based on 
substrate specificity and affinity, sensitivity to cofactors, and sensitivity to inhibitory drugs (Beavo, 
J A. (1995) Physiological Reviews 75:725-748). PDE inhibitors have been found to be particularly 
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useful in treating various clinical disorders. Rolipram, a specific inhibitor of PDE4, has been used in 
the treatment of depression, and similar inhibitors are undergoing evaluation as anti-inflammatory 
agents. Theophylline is a nonspecific PDE inhibitor used in the treatment of bronchial asthma and 
other respiratory diseases (Banner, K.H. and CP. Page (1995) Eur. Respir. J. 8:996-1000). 
5 G-Protein Signaling 

Guanine nucleotide binding proteins (G-proteins) are critical mediators of signal transduction 
between a particular class of extracellular receptors, the G-protein coupled receptors (GPCR), and 
intracellular second messengers such as cAMP and Ca 2 \ G-proteins are linked to the cytosolic side 
of a GPCR such that activation of the GPCR by ligand binding stimulates binding of tlie G-protein to 

10 GXP, inducing an "active" state in the G-protein. In the active state, the G-protein acts as a signal to 
trigger other events in the cell such as the increase of cAMP levels or the release of Ca 2+ into the 
cytosol from the ER, which, in turn, regulate phosphorylation and activation of other intracellular 
proteins. Recycling of the G-protein to the inactive state involves hydrolysis of the bound GTP to 
GDP by a GTPase activity in the G-protein. (See Alberts, B. et al. (1994) Molecular B iology of the 

15 Cell . Garland Publishing, Inc., New York NY, pp.734-759.) Two structurally distinct classes of G- 
proteins are recognized: heterotrimeric G-proteins, consisting of three different subunits, and 
monomelic, low molecular weight (LMW), G-proteins consisting of a single polypeptide chain. 

The three polypeptide subunits of heterotrimeric G-proteins are the a, p, and y subunits. The 
a subunit binds and hydrolyzes GTP. The p and y subunits form a tight complex that anchors the 

20 protein to the inner side of the plasma membrane. The P subunits, also known as G-p proteins or p 
transducins, contain seven tandem repeats of the WD-repeat sequence motif, a motif found in many 
proteins with regulatory functions. Mutations and variant expression of P transducin proteins are 
linked with various disorders (Neer, EJ. et al. (1994) Nature 371:297-300; Margottin, F. et al. (1998) 
Mol. Cell 1:565-574). 

25 LMW GTP-proteins are GTPases which regulate cell growth, cell cycle control, protein 

secretion, and intracellular vesicle interaction. They consist of single polypeptides which, like the a 
subunit of the heterotrimeric G-proteins, are able to bind and hydrolyze GTP, thus cycling between an 
inactive and an active state. At least sixty members of the LMW G-protein superfamily have been 
identified and are currently grouped into the six subfamilies of ras, rho, arf, sari, ran, and rab. 

3 o Activated ras genes were initially found in human cancers, and subsequent studies confirmed that ras 
function is critical in determining whether cells continue to grow or become differentiated. Other 
members of the LMW G-protein superfamily have roles in signal transduction that vary with the 
function of the activated genes and the locations of the G-proteins. 

Guanine nucleotide exchange factors regulate the activities of LMW G-proteins by 
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detennining whether GTP or GDP is bound. GTPase- activating protein (GAP) binds to GTP-ras and 
induces it to hydrolyze GTP to GDP. In contrast, guanine nucleotide releasing protein (GNRP) binds 
to GDP-ras and induces the release of GDP and the binding of GTP. 

Other regulators of G-protein signaling (RGS) also exist that act primarily by negatively 
5 regulating the G-protein pathway by an unknown mechanism (Druey, K.M. et al. (1996) Nature 
379:742-746). Some 15 members of the RGS family have been identified. RGS family members are 
related structurally through similarities in an approximately 120 amino acid region termed the RGS 
domain and functionally by their ability to inhibit the interleukin (cytokine) induction of MAP kinase 
in cultured mammalian 293T cells (Druev: supraV 

10 Calcium Signaling Molecules 

Ca +2 is another second messenger molecule that is even more widely used as an intracellular 
mediator than cAMP. Two pathways exist by which Ca +2 can enter the cytosol in response to 
extracellular signals: One pathway acts primarily in nerve signal transduction where Ca +a enters a 
nerve terminal through a voltage-gated Ca +2 channel. The second is a more ubiquitous pathway in 

(15 which Ca +2 is released from the ER into the cytosol in response to binding of an extracellular 

signaling molecule to a receptor. Ca 2 * directly activates regulatory enzymes, such as protein kinase C, 
which trigger signal transduction pathways. Ca 2+ also binds to specific Ca^-binding proteins (CBPs) 
such as calmodulin (CaM) which then activate multiple target proteins in the cell including enzymes, 
membrane transport pumps, and ion channels. CaM interactions are involved in a multitude of cellular 

2 o processes including, but not limited to, gene regulation, DNA synthesis, cell cycle progression, 
mitosis, cytokinesis, cytoskeletal organization, muscle contraction, signal transduction, ion 
homeostasis, exocytosis, and metabolic regulation (Celio, M.R. et al. (1996) Guidebook to 
Calcium-binding Proteins . Oxford University Press, Oxford, UK, pp. 15-20). Some CBPs can serve 
as a storage depot for Ca 2+ in an inactive state. Calsequestrin is one such CBP that is expressed in 

2 5 isoforms specific to cardiac muscle and skeletal muscle. It is suggested that calsequestrin binds Ca 2+ 

in a rapidly exchangeable state that is released during Ca 2+ -signaling conditions (Celio, M.R. et al. 
(1996) Guidebook to Calcium-binding Proteins . Oxford University Press, New York NY, pp. 222- 
224). 
Cvclins 

3 o Cell division is the fundamental process by which all living things grow and reproduce. In 

most organisms, the cell cycle consists of three principle steps; interphase, mitosis, and cytokinesis. 
Interphase, involves preparations for cell division, replication of the DNA and production of essential 
proteins. In mitosis, the nuclear material is divided and separates to opposite sides of the cell. 
Cytokinesis is the final division and fission of the cell cytoplasm to produce the daughter cells. 
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The entry and exit of a cell from mitosis is regulated by the synthesis and destruction of a 
family of activating proteins called cyclins. Cyclins act by binding to and activating a group of 
cyclin-dependent protein kinases (Cdks) which then phosphorylate and activate selected proteins 
involved in the mitotic process. Several types of cyclins exist (Ciechanover, A. (1994) Cell 
5 79:13-21.) Two principle types are mitotic cyclin, or cyclin B, which controls entry of the cell into' 
mitosis, and Gl cyclin, which controls events that drive the ceil out of mitosis. 
Signal Complex Scaffolding Proteins 

Ceretain proteins in intracellular signaling pathways serve to link or cluster other proteins 
involved in the signaling cascade. A conserved protein domain called the PDZ domain has been 

l o identified in various membrane-associated signaling proteins. This domain has been implicated in 
receptor and ion channel clustering and in the targeting of multiprotein signaling complexes to 
specialized functional regions of the cytosolic face of the plasma membrane. (For a review of PDZ 
domain-containing proteins, see Ponting, CP. et d. (1997) Bioessays 19:469-479.) A large 
proportion of PDZ domains are found in the eukaryotic MAGUK (membrane-associated guanylate ; 

15 kinase) protein family, members of which bind to the intracellular domains of receptors and channels. 
However, PDZ domains are also found in diverse membrane-localized proteins such as protein 
tyrosine phosphatases, serine/threonine kinases, G-protein cof actors, and synapse-associated proteins 
such as syntrophins and neuronal nitric oxide synthase (nNOS). Generally, about one to three PDZ 
domains are found in a given protein, although up to nine PDZ domains have been identified in a 

20 single protein. 

Membrane Transport Molecules 

The plasma membrane acts as a barrier to most molecules. Transport between the cytoplasm 
and the extracellular environment, and between the cytoplasm and lumenal spaces of cellular 

25 organelles requires specific transport proteins. Each transport protein carries a particular class of 
molecule, such as ions, sugars, or amino acids, and often is specific to a certain molecular species of 
the class. A variety of human inherited diseases are caused by a mutation in a transport protein. For 
example, cystinuria is an inherited disease that results from the inability to transport cystine, the 
disulfide-linked dimer of cysteine, from the urine into the blood. Accumulation of cystine in the 

3 o urine leads to the formation of cystine stones in the kidneys. 

Transport proteins are multi-pass transmembrane proteins, which either actively transport 
molecules across the membrane or passively allow them to cross. Active transport involves 
directional pumping of a solute across the membrane, usually against an electrochemical gradient 
Active transport is tightly coupled to a source of metabolic energy, such as ATP hydrolysis or an 
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electrochemically favorable ion gradient Passive transport involves the movement of a solute down 
its electrochemical gradient Transport proteins can be further classified as either carrier proteins or 
channel proteins. Carrier proteins, which can function in active or passive transport, bind to a specific 
solute to be transported and undergo a conformational change which transfers the bound solute across 
5 the membrane. Channel proteins, which only function in passive transport, form hydrophilic pores 
across the membrane. When the pores open, specific solutes, such as inorganic ions, pass through the 
membrane and down the electrochemical gradient of the soiute. , 

Carrier proteins which transport a single solute from one side of the membrane to the other 
are called unipprters. In contrast, coupled transporters link the transfer of one solute with 

10 simultaneous or sequential transfer of a second solute, either in the same direction (symport) or in the 

» i 

opposite direction (antiport). For example/intestinal and kidney epithelium contains a variety of 

symporter systems driven by the sodium gradient that exists across the plasma membrane. Sodium 

i 

moves into the cell down its electrochemical gradient and brings the solute into the cell with it. The 
sodium gradient that provides the driving force for solute uptake is maintained by the ubiquitous 

15 Na7K* ATPase. Sodium-coupled transporters include the mammalian glucose transporter (SGLT1), 
iodide transporter (NIS), and multivitamin transporter (SMVT). All three transporters have twelve 
putative transmembrane segments, extracellular glycosylation sites, and cytoplasmically-oriented N- 
and C-termini. NIS plays a crucial role in the evaluation, diagnosis, and treatment of various thyroid 
pathologies because it is the molecular basis for radioiodide thyroid-imaging techniques and for 

2 o specific targeting of radioisotopes to the thyroid gland (Levy, O. et al. (1997) Proc. Natl. Acad. Sci. 
USA 94:5568-5573). SMVT is expressed in the intestinal mucosa, kidney, and placenta, and is 
implicated in the transport of the water-soluble vitamins, e.g., biotin and pantothenate (Prasad, P.D. et 
al. (1998) J. Biol. Chem. 273:7501-7506). 

Transporters play a major role in the regulation of pH, excretion of drugs, and the cellular 

2 5 K7Na + balance. Monocarboxylate anion transporters are proton-coupled symporters with a broad 

substrate specificity that includes L-lactate, pyruvate, and the ketone bodies acetate, acetoacetate, and 
beta-hydroxybutyrate. At least seven isoforms have been identified to date. The isoforms are predicted 
to have twelve transmembrane (TM) helical domains with a large intracellular loop between TM6 and 
TM7, and play a critical role in maintaining intracellular pH by removing the protons that are produced 

3 o stoichiometrically with lactate during glycolysis. The best characterized H(+)-monocarboxylate 

transporter is that of the erythrocyte membrane, which transports L-lactate and a wide range of other 
aliphatic monocarboxylates. Other cells possess H(+)-linked monocarboxylate transporters with 
differing substrate and inhibitor selectivities. In particular, cardiac muscle and tumor cells have 
transporters that differ in their values for certain substrates, including stereoselectivity for L- over 
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D-lactate, and in their sensitivity to inhibitors. There are Na(+)-monocarboxylate cotransportears on the 
luminal surface of intestinal and kidney epithelia, which allow the uptake of lactate, pyruvate, and 
ketone bodies in these tissues. In addition, there are specific and selective transporters for organic 
cations and organic anions in organs including the kidney, intestine and liver. Organic anion 
5 transporters are selective for hydrophobic, charged molecules with electron-attracting side groups. 
Organic cation transporters, such as the ammonium transporter, mediate the secretion of a variety of 
drugs and endogenous metabolites, and contribute to themaintenance of intercellular pH. (Poole, R.C. 
and A.P. Halestrap (1993) Am. J. Physio! 264:C761-C782; Price, N.T. et al. (1998) Biochem. J. 
329:321-328; and Martinelle, K. and I. Haggstrom (1993) J. Biotechnol. 30: 339-350.) 

1 o The largest and most diverse family of transport proteins known is the ATP-binding cassette 

(ABC) transporters. As a family, ABC transporters can transport substances that differ markedly in 
chemical structure and size, ranging from small molecules such as ions, sugars, amino acids, peptides, 
and phospholipids, to lipopeptides, large proteins, and complex hydrophobic drugs. ABC proteins 
consist of four modules: two nucleotide-binding domains (NBD), which hydrolyze ATP to supply the 
1 5 energy required for transport, and two membrane-spanning domains (MSD), each containing six 

putative transmembrane segments. These four modules may be encoded by a single gene, as is the case 
for the cystic fibrosis transmembrane regulator (CFTR), or by separate genes. When encoded by 
separate genes, each gene product contains a single NBD and MSD. These "half-molecules" form 
homo- and heterodimers, such as Tapl and Tap2, the endoplasmic reticulum-based major 

2 o histocompatibility (MHC) peptide transport system. Several genetic diseases are attributed to defects in 

ABC transporters, such as the following diseases and their corresponding proteins: cystic fibrosis 
(CFTR, anion channel), adrenoleukodystrophy (adrenoleukodystrqphy protein, ALDP), Zellweger 
syndrome (peroxisomal membrane protein-70, PMP70), and hyperinsulinemic hypoglycemia 
(sulfonylurea receptor, SUR). Overexpression of the multidrug resistance (MDR) protein, another 
25 ABC transporter, in human cancer cells makes the cells resistant to a variety of cytotoxic drugs used in 
chemotherapy (Taglight, D. and S. Michaelis (1998) Meth. Enzymol. 292:131-163). 

Transport of fatty acids across the plasma membrane can occur by diffusion, a high capacity, 
low affinity process. However, under normal physiological conditions a significant fraction of fatty 
acid transport appears to occur via a high aifinity, low capacity protein-mediated transport process. 

3 o Fatty acid transport protein (FATP), an integral membrane protein with four transmembrane segments, 

is expressed in tissues exhibiting high levels of plasma membrane fatty acid flux, such as muscle, heart, 
and adipose. Expression of FATP is upregulated in 3T3-L1 cells during adipose conversion, and 
expression in COS7 fibroblasts elevates uptake of long-chain fatty acids (Hui, T.Y. et al. (1998) J. 
Biol. Chem. 273:27420-27429). 
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Ion Channels 

The electrical potential of a cell is generated and maintained by controlling the movement of 
ions across the plasma membrane. The movement of ions requires ion channels, which form an ion- 
selective pore within the membrane. There are two basic types of ion channels, ion transporters and 
5 gated ion channels. Ion transporters utilize the energy obtained from ATP hydrolysis to actively 
transport an ion against the ion's concentration gradient Gated ion channels allow passive flow of ail 
ion down the ion's electrochemical gradient under restricted conditions. Together, these types of ion 
channels generate, maintain, and utilize an electrochemical gradient that is used in 1) electrical impulse 
conduction down the axon of a nerve cell, 2) transport of molecules into cells against concentration 

1 o gradients, 3) initiation of muscle contraction, and 4) endocrine cell secretioa 

Ion transporters generate and maintain the resting electrical potential of a celL Utilizing the 
energy derived from ATP hydrolysis, they transport ions against the ion' s concentration gradient. 
These transmembrane ATPases are divided into three families. The phosphorylated (P) class ion 
transporters, including Na + -K + ATPase, Ca^-ATPase, and H + -ATPase, are activated by a 
15 phosphorylation event. P-class ion transporters are responsible for maintaining resting potential 

distributions such that cytosolic concentrations of Na + and Ca 2 * are low and cytosolic concentration of 
K + is high. The vacuolar (V) class of ion transporters includes H + pumps on intracellular organelles, 
such as lysosomes and Golgi. V-class ion transporters are responsible for generating the low pH within 
the lumen of these organelles that is required for functioa The coupling factor (F) class consists of H* 

2 o pumps in the mitochondria. F-class ion transporters utilize a proton gradient to generate ATP from 

ADP and inorganic phosphate (Pj). 

The resting potential of the cell is utilized in many processes involving carrier proteins and 

gated ion channels. Carrier proteins utilize the resting potential to transport molecules into and out of 

the cell. Amino acid and glucose transport into many cells is linked to sodium ion co-transport 
2 5 (symport) so that the movement of Na + down an electrochemical gradient drives transport of the other 

molecule up a concentration gradient. Similarly, cardiac muscle links transfer of Ca 2 * out of the cell 

with transport of Na + into the cell (antiport). 

Ion channels share common structural and mechanistic themes. The channel consists of four or 

five subunits or protein monomers that are arranged like a barrel in the plasma membrane. Each 
30 subunit typically consists of six potential transmembrane segments (SI, S2, S3, S4, S5, and S6). Hie 

center of the barrel forms a pore lined by a-helices or p-strands. The side chains of the amino acid 

residues comprising the a-helices or p-strands establish the charge (cation or anion) selectivity of the 

channel. The degree of selectivity, or what specific ions are allowed to pass through the channel, 

depends on the diameter of the narrowest part of the pore. 
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Gated ion channels control ion flow by regulating the opening and closing of pores. These 
channels are categorized according to the manner of regulating the gating function. Mechanically-gated 
channels open pores in response to mechanical stress, voltage-gated channels open pores in response to 
changes in membrane potential, and ligand-gated channels open pores in the presence of a specific ion, 
5 nucleotide, or neurotransmitter, 
t Voltage-gated Na + and K + channels are necessary for the function of electrically excitable cells, 

such as nerve and muscle cells. Action potentials, which lead to neurotransmitter release and muscle 
contraction, arise from large, transient changes in the permeability of the membrane to Na + and K* ions. 
Depolarization of the membrane beyond the threshold level opens voltage-gated Na + channels. Sodium 

l o ions flow into the cell, further depolarizing the membrane and opening more voltage-gated Na + 

channels, which propagates the depolarization down the length of the cell. Depolarization also opens 
voltage-gated potassium channels. Consequently, potassium ions flow outward, which leads to 
repolarization of the membrane. Voltage-gated channels utilize charged residues in the fourth 
transmembrane segment (S4) to sense voltage change. The open state lasts only about 1 millisecond, at 

1 5 which time the channel spontaneously converts into an inactive state that cannot be opened irrespective 
of the membrane potential. Inactivation is mediated by the channel's N-terminus, which acts as a plug 
that closes the pore. The transition from an inactive to a closed state requires a return to resting 
potential. 

Voltage-gated Na+ channels are heterotrimeric complexes composed of a 260 kDa pore forming 
20 a subunit that associates with two smaller auxiliary subunits, 01 and 02. The 02 subunit is an integral 
membrane glycoprotein that contains an extracellular Ig domain, and its association with a and 01 
subunits correlates with increased functional expression of the channel, a change in its gating 
properties, and an increase in whole cell capacitance due to an increase in membrane surface area. 
(Isom, L.L. et al. (1995) Cell 83:433-442.) 
25 Voltage-gated Ca 2 * channels are involved in presynaptic neurotransmitter release, and heart 

and skeletal muscle contraction. The voltage-gated Ca 2+ channels from skeletal muscle (L-type) and 
brain (N-type) have been purified, and though their functions differ dramatically, they have similar 
subunit compositions. The channels are composed of three subunits. The a x subunit forms the 
membrane pore and voltage sensor, while the 0^8 and 0 subunits modulate the voltage-dependence, 
3 o gating properties, and the current amplitude of the channel. These subunits are encoded by at least six 
a x , one ctjS, and four 0 genes. A fourth subunit, y, has been identified in skeletal muscle. (Walker, D. 
et al. (1998) J. Biol. Chem. 273:2361-2367; and Jay, S.D. et al. (1990) Science 248:49(M92.) 

Chloride channels are necessary in endocrine secretion and in regulation of cytosolic and 
organdie pH. In secretory epithelial cells, CI* enters the cell across a basolateral membrane through an 
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Na + , KVCT cotransporter, accumulating in the cell above its electrochemical equilibrium concentratioa 
Secretion of CT from the apical surface* in response to hormonal stimulation, leads to flow of Na + and 
water into the secretory lumen. The cystic fibrosis transmembrane conductance regulator (CFTR) is a 
chloride channel encoded by the gene for cystic fibrosis, a common fatal genetic disorder in humans. 
5 Loss of CFTR junction decreases transepithelial water secretion and, as a result, the layers of mucus 
that coat the respiratory tree, pancreatic ducts, and intestine are dehydrated and difficult to clear. The 
resulting blockage of these sites leads to pancreatic insufficiency; ''meconium ileus", and devastating 
"chronic obstructive pulmonary disease" (Al-Awqati, Q. et al. (1992) J. Exp. Biol. 172:245-266). 

Many intracellular organelles contain H + -ATPase pumps that generate transmembrane pH and 

io electrochemical differences by moving protons from the cytosol to the organelle lumen. If the 
membrane of the organelle is permeable to other ions, then the electrochemical gradient can be 
abrogated without affecting the pH differential. In fact, removal of the electrochemical barrio: allows 
more H * to be pumped across the membrane, increasing the pH differential. CI * is the ,sole counterion 
of H + translocation in a number of organelles, including chromaffin granules, Golgi vesicles, 

15 lysosomes, and endosomes. Functions that require a low vacuolar pH include uptake of small 

molecules such as biogenic amines in chromaffin granules, processing of vacuolar constituents such as 
pro-hormones by proteolytic enzymes, and protein degradation in lysosomes (Al-Awqati, supra) . 

Ligand-gated channels open their pores when an extracellular or intracellular mediator binds to 
the channel. Newotf ansmitter-gated channels are channels that open when a neurotransmitter binds to 

2 o their extracellular domain- These channels exist in the postsynaptic membrane of nerve or muscle cells. 
There are two types of neurotransmitter-gated channels. Sodium channels open in response to 
excitatory neurotransmitters, such as acetylcholine, glutamate, and serotonin. This opening causes an 
influx of Na* and produces the initial localized depolarization that activates the voltage-gated channels 
and starts the action potential. Chloride channels open in response to inhibitory neurotransmitters, such 

25 as y-aminobutyric acid (GAB A) and glycine, leading to hyperpolarization of the membrane and the 
subsequent generation of an action potential. 

Ligand-gated channels can be regulated by intracellular second messengers. Calcium-activated 
K + channels are gated by internal calcium ions. In nerve cells, an influx of calcium during 
depolarization opens K + channels to modulate the magnitude of the action potential (Ishi, T.M. et al. 

30 (1997) Proc. Natl. Acad. Sci. USA 94:1 1651-1 1656). Cyclic nucleotide-gated (CNG) channels are 
gated by cytosolic cyclic nucleotides. The best examples of these are the cAMP-gated Na + channels 
involved in olfaction and the cGMP-gated cation channels involved in visioa Both systems involve 
ligand-mediated activation of a G-protein coupled receptor which then alters the level of cyclic 
nucleotide within the cell. 
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Ion channels are expressed in a number of tissues where they are implicated in a variety of 
processes. CNG channels, while abundantly expressed in photoreceptor and olfactory sensory cells, are 
also found in kidney, lung, pineal, retinal ganglion cells, testis, aorta, and brain. Calcium-activated K + 
channels may be responsible for the vasodilatory effects of bradykinin in the kidney and for shunting 
5 excess K + from brain capillary endothelial cells into the blood. They are also implicated in repolarizing 
granulocytes after agonist-stimulated depolarization (Ishi, supraV . Ion channels have been the target for 
many drug therapies. Neurotransmitter-gated channels have been targeted in therapies for treatment of . 
insomnia, anxiety, depression, and schizophrenia. Voltage-gated channels have been targeted in 
therapies for arrhythmia, ischemic stroke, head trauma, and neurodegenerative disease (Taylor, CP. 
10 and L.S. Narasimhan (1997) Adv. Pharmacol. 39:47-98). 
Disease Correlation 

The etiology of numerous human diseases and disorders can be attributed to defects in the 
transport of molecules across membranes. Defects in the trafficking of membrane-bound transporters 
and ion channels are associated with several disorders, e.g. cystic fibrosis, glucose-galactose 
1 5 malabsorption syndrome, hypercholesterolemia, von Gierke disease, and certain forms of diabetes 
mellitus. Single-gene defect diseases resulting in an inability to transport small molecules across 
membranes include, e.g., cystinuria, iminoglycinuria, Hartup disease, and Fanconi disease (vant HcdEf, 
W.G. (1996) Exp. Nephrol. 4:253-262; Talente, G.M. et aL (1994) Ann. Intern. Med. 120:218-226; 
and Chillon, M. et aL (1995) New Engl. J. Med. 332:1475-1480). 

20 

Protein Modification and Maintenance Molecules 

The cellular processes regulating modification and maintenance of protein molecules 
coordinate their conformation, stabilization, and degradation. Each of these processes is mediated by 
key enzymes or proteins such as proteases, protease inhibitors, transferases, isomerases, and 

2 5 molecular chaperones. 

Proteases 

Proteases cleave proteins and peptides at the peptide bond that forms the backbone of the 
peptide and protein chain. Proteolytic processing is essential to cell growth, differentiation, 
remodeling, and homeostasis as well as inflammation and immune response. Typical protein half- 

3 o lives range from hours to a few days, so that within all living cells, precursor proteins are being 

cleaved to their active form, signal sequences proteolytically removed from targeted proteins, and 
aged or defective proteins degraded by proteolysis. Proteases function in bacterial, parasitic, and viral 
invasion and replication within a host. Four principal categories of mammalian proteases have been 
identified based on active site structure, mechanism of action, and overall three-dimensional structure. 
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(Beynon, RJ. and J.S. Bond (1994) Proteolytic Enzymes: A Practical Approach. Oxford University 
Press, New York NY, pp. 1-5). 

The, serine proteases (SPs) have a serine residue, usually within a conserved sequence, in an 
active site composed of the serine, an aspartate, and a histidine residue. SPs include the digestive 
5 enzymes trypsin and chymotrypsin, components of the complement cascade and the blood-clotting 
cascade, and enzymes that control extracellular protein degradation. The main SP subfamilies are 
trypases, which cleave after arginine or lysine; aspartases, which cleave after aspartate; chymases, 
which cleave after phenylalanine or leucine; metases, which cleavage after methionine; and serases 
which cleave after serine. Enterokinase, the initiator of intestinal digestion, is a serine protease, found 

10 in the intestinal brush border, where it cleaves the acidic propeptide from trypsinogen to yield active 
trypsin (Kitamoto, Y. et al. (1994) Proc. Natl. Acad. Sci. USA 91:7588-7592). 
Prolylcarboxypeptida^e, a lysosomal serine peptidase that cleaves peptides such as angiotensin II and 
HI and [des-Arg9] bradykinin, shares sequence homology with members of both the serine 
carboxypeptidase andprolylendopeptidase families (Tan, F. et al. (1993) J. Biol. Chem. 268:16631- 

15 16638). 

Cysteine proteases (CPs) have a cysteine as the major catalytic residue at an active site where 
catalysis proceeds via an intermediate thiol ester and is facilitated by adjacent histidine and aspartic 
acid residues. CPs are involved in diverse cellular processes ranging from the processing of precursor 
proteins to intracellular degradation. Mammalian CPs include lysosomal cathepsins and cytosolic 

20 calcium activated proteases, calpains. CPs are produced by monocytes, macrophages and other cells 
of the immune system which migrate to sites of inflammation and secrete molecules involved in 
tissue repair. Overabundance of these repair molecules plays a role in certain disorders. In 
autoimmune diseases such as rheumatoid arthritis, secretion of the cysteine peptidase cathepsin C 
degrades collagen, laminin, elastin and other structural proteins found in the extracellular matrix of 

25 bones. 

Aspartic proteases are members of the cathepsin family of lysosomal proteases and include 
pepsin A, gastricsin, chymosin, renin, and cathepsins D and E. Aspartic proteases have a pair of 
aspartic acid residues in the active site, and are most active in the pH 2 - 3 range, in which one of the 
aspartate residues is ionized, the other un-ionized. Aspartic proteases include bacterial 
3 o penicillopepsin, mammalian pepsin, renin, chymosin, and certain fungal proteases. Abnormal 
regulation and expression of cathepsins is evident in various inflammatory disease states. In cells 
isolated from inflamed synovia, the mRNA for stromdysin, cytokines, HMP-1 , cathepsin, gelatinase, 
and other molecules is preferentially expressed. Expression of cathepsins L and D is elevated in 
synovial tissues from patients with rheumatoid arthritis and osteoarthritis. Cathepsin L expression may 
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also contribute to the influx of mononuclear cells which exacerbates the destruction of the rheumatoid 
synovium. (Keyszer, G.M. (1995) Arthritis Rheum. 38:976-984.) The increased expression and 
differential regulation of the cathepsins are linked to the metastatic potential of a variety of cancers and 
as such are of therapeutic and prognostic interest (Chambers, A.F. et al. (1993) Crit Rev. Oncog. 
5 4:95-114). 

Metalloproteases have active sites that include two glutamic acid residues and one histidine 
residue that serve as binding sites for zinc. Carboxypeptidases A and B are the principal mammalian 
metalloproteases. Both are exoproteases of similar structure and active sites. Carboxypeptidase A, 
like chymotrypsin, prefers C-terminal aromatic and aliphatic side chains of hydrophobic nature, 
1 o whereas carboxypeptidase B is directed toward basic arginine and lysine residues. Glycoprotease 
(GCP), or O-sialoglycjoprotein endopeptidase, is a metallopeptidase which specifically cleaves 

O-sialoglycoproteins such as glycophorin A. Another metallopeptidase, placental leucine 

i 

aminopeptidase (P-LAP) degrades several peptide hormones such as oxytocin and vasopressin, 
suggesting a role in maintaining homeostasis during pregnancy, and is expressed in several tissues 

15 (Rogi, T. et al. (1996) J. Biol. Chem. 271:56-61). 

Ubiquitin proteases are associated with the ubiquitin conjugation system (UCS), a major 
pathway for the degradation of cellular proteins in eukaryotic cells and some bacteria; The UCS 
mediates the elimination of abnormal proteins and regulates the half-lives of important regulatory : 
proteins that control cellular processes such as gene transcription and cell cycle progression. In the * 

20 UCS pathway, proteins targeted for degradation are conjugated to a ubiquitin, a small heat stable 
protein. The ubiquitinated protein is then recognized and degraded by proteasome, a large, 
multisubunit proteolytic enzyme complex, and ubiquitin is released for reutilization by ubiquitin 
protease. The UCS is implicated in the degradation of mitotic cyclic kinases, oncoproteins, tumor 
suppressor genes such as p53, viral proteins, cell surface receptors associated with signal 

25 transduction, transcriptional regulators, and mutated or damaged proteins (Ciechanover, A. (1994) 
Cell 79: 1 3-21). A murine proto-oncogene, Unp, encodes a nuclear ubiquitin protease whose 
overexpression leads to oncogenic transformation of NIH3T3 cells, and the human homolog of this 
gene is consistently elevated in small cell tumors and adenocarcinomas of the lung (Gray, D.A. 
(1995) Oncogene 10:2179-2183). 

30 Signal Peptidases 

The mechanism for the translocation process into the endoplasmic reticulum (ER) involves 
the recognition of an N-terminal signal peptide on the elongating protein. The signal peptide directs 
the protein and attached ribosome to a receptor on the ER membrane. The polypeptide chain passes 
through a pore in the ER membrane into the lumen while the N-terminal signal peptide remains 

73 

BNSDOaD:<WO_0162927A2J_> ■** 



WO 01/62927 



PCT/US01/06059 



attached at the membrane surface. The process is completed when signal peptidase located inside the 
ER cleaves the signal peptide from the protein and releases the protein into the lumen. 
Protease Inhibitors 

Protease inhibitors and other regulators of protease activity control the activity and effects of 
5 proteases. Protease inhibitors have been shown to control pathogenesis in animal models of 
proteolytic disorders (Murphy, G. (1991) Agents Actions Suppl. 35:69-76). Low levels of the 
cystatins, low molecular weight inhibitors of the cysteine proteases, correlate with malignant 
progression of tumors. (Calkins, C. et al (1995) Biol. Biochem. Hoppe Seyler 376:71-80). , Serpins 
are inhibitors of mammalian plasma serine proteases. Many serpins serve to regulate the blood 

10 clotting cascade and/or the complement cascade in mammals. Sp32 is a positive regulator of the 
mammalian acrosomal protease, acrosin, that binds the proenzyme, proacrosin, and thereby aides in 
packaging the enzyme into the acrosomal matrix (Baba, T. et al. (1994) J. Biol. Chem. 269:10133- ' 
10140). The Kunitz family of serine protease inhibitors are characterized by one or more "Kunitz 
domains" containing a series of cysteine residues that are regularly spaced over approximately 50 

1 is amino acid residues and form three intrachain disulfide bonds. Members of this family include 

aprotinin, tissue factor pathway inhibitor (TFPI-1 and TFPI-2), inter-ot-trypsin inhibitor, and bikunin. 
(Marlor, C.W. et al. (1997) J. Biol. Chem. 272:12202-12208.) Members of this family are potent 
inhibitors (in the nanomolar range) against serine proteases such as kallikrein and plasmin. Aprotinin 
has clinical utility in reduction of perioperative blood loss. 

20 A major portion of all proteins synthesized in eukaryotic cells are synthesized on the 

cytosolic surface of the endoplasmic reticulum (ER). Before these immature proteins are distributed 
to other organelles in the cell or are secreted, they must be transported into the interior lumen of the 
ER where post-translational modifications are performed. These modifications include protein folding 
and the formation of disulfide bonds, and N-linked glycosylations. 

25 Protein Isomerases 

Protein folding in the ER is aided by two principal types of protein isomerases, protein 
disulfide isomerase (PDI), and peptidyl-prolyl isomerase (PPI). PDI catalyzes the oxidation of free 
sulfhydryl groups in cysteine residues to form intramolecular disulfide bonds in proteins. PPI, an 
enzyme that catalyzes the isomerization of certain proline imidic bonds in oligopeptides and proteins, 

30 is considered to govern one of the rate limiting steps in the folding of many proteins to their final 
functional conformation. The cyclophilins represent a major class of PPI that was originally 
identified as the major receptor for the immunosuppressive drug cyclosporin A (Handschumacher, 
R.E. et al. (1984) Science 226: 544-547). 
Protein Glvcosvlation 
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Hie glycosylation of most soluble secreted and membrane-bound proteins by 
oligosaccharides linked to asparagine residues in proteins is also performed in the ER. This reaction 
is catalyzed by a membrane-bound enzyme; oligpsaccharyl transferase. Although the exact purpose 
of this "N-linked" glycosylation- is unknown, the presence of oligosaccharides tends to make a 
5 glycoprotein resistant to protease digestion. In addition, oligosaccharides attached to cell-surface 
proteins called selectins are known to function in cell-cell adhesion processes (Alberts, B. et al. 
(1994) Molecular Biology of the Cell : Garland Publishing Co., New York NY, p.608). "O-linked" 
glycosylation of proteins also occurs in the ER by the addition of N-acetylgalactosamine to the 
hydroxyl group of a serine or threonine residue followed by the sequential addition of other sugar 

10 residues to the first. This process is catalysed by a series of glycosyltransf erases each specific for a 
particular donor sugar nucleotide and acceptor molecule (Lodish, H. et al. (1995) Molecular Cell 
Biology , W.H. Freeman and Co., New York NY, pp.700-708). In many cases, both N- and O-linked 
oligosaccharides appear to be required for the secretion of proteins or the movement of plasma 
membrane glycoproteins to the cell surface. 

15 An additional glycosylation mechanism operates in the ER specifically to target lysosomal t 

enzymes to lysosomes and prevent their secretion. Lysosomal enzymes in the ER receive an N-linked 
oligosaccharide, like plasma membrane and secreted proteins, but are then phosphorylated on one or 
two mannose residues. The phosphorylation of mannose residues occurs in two steps, the first step 
being the addition of an N-acetylglucosamine phosphate residue by N-acetylglucosamine 

20 phosphotransferase, and the second the removal of the N-acetylglucosamine group by 

phosphodiesterase. The phosphorylated mannose residue then targets the lysosomal enzyme to a 
mannose 6-phosphate receptor which transports it to a lysosome vesicle (Lodish, supra , pp. 708-71 1). 
Chaperones 

Molecular chaperones are proteins that aid in the proper folding of immature proteins and 
25 refolding of improperly folded ones, the assembly of protein subunits, and in the transport of 

unfolded proteins across membranes. Chaperones are also called heat-shock proteins (hsp) because of 
their tendency to be expressed in dramatically increased amounts following brief exposure of cells to 
elevated temperatures. This latter property most likely reflects their need in the refolding of proteins 
that have become denatured by the high temperatures. Chaperones may be divided into several 
3 o classes according to their location, function, and molecular weight, and include hsp60, TCP1 , hsp70, 
hsp40 (also called DnaJ), and hsp90. For example, hsp90 binds to steroid hormone receptors, 
represses transcription in the absence of the ligand, and provides proper folding of the ligand-binding 
domain of the receptor in the presence of the hormone (Burston, S.G. and A.R. Clarke (1995) Essays 
Biochem. 29:125-136). Hsp60 and hsp70 chaperones aid in the transport and folding of newly 
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synthesized proteins. Hsp70 acts early in protein folding, binding a newly synthesized protein before 
it leaves the ribosome and transporting the protein to the mitochondria or ER before releasing the 
folded protein. Hsp60, along with hsplO, binds misfolded proteins and gives them the opportunity to 
refold correctly. All chaperones share an affinity for hydrophobic patches on incompletely folded 
5 proteins and the ability to hydrolyze ATP. The energy of ATP hydrolysis is used to release the hsp- 
bound protein in its properly folded state (Alberts, supra , pp 214, 571-572). 

Nucleic Acid Synthesis and Modification Molecules 
Polymerases 

10 DNA and RNA replication are critical processes for cell replication and function. DNA and 

RNA replication are mediated by the enzymes DNA and RNA polymerase, respectively, by a 
. "templating" process in which the nucleotide sequence of a DNA or RNA strand is copied by 

complementary base-pairing into a complementary nucleic acid sequence of either DNA or RNA. 

i 

However, there are fundamentaTdifferences between the two processes. 

15 DNA polymerase catalyzes the stepwise addition of a deoxyribonucleotide to the 3'-OH end 

of a polynucleotide strand (the primer strand) that is paired to a second (template) strand. The new 
DNA strand therefore grows in the 5' to 3' direction (Alberts, B. et al. (1994) The Molecular Biology 
of the Cell . Garland Publishing Inc., New York NY, pp. 25 1-254). The substrates for the 
polymerization reaction are the corresponding deoxynucleotide triphosphates which must base-pair 

2 o with the correct nucleotide on the template strand in order to be recognized by the polymerase. 

Because DNA exists as a double-stranded helix, each of the two strands may serve as a template for 
the formation of a new complementary strand. Each of the two daughter cells of the dividing cell 
therefore inherits a new DNA double helix containing one old and one new strand. Thus, DNA is 
said to be replicated "semiconservatively" by DNA polymerase. In addition to the synthesis of new 

2 5 DNA, DNA polymerase is also involved in the repair of damaged DNA as discussed below under 
"Ligases." 

In contrast to DNA polymerase, RNA polymerase uses a DNA template strand to "transcribe" 
DNA into RNA using ribonucleotide triphosphates as substrates. Like DNA polymerization, RNA 
polymerization proceeds in a 5' to 3' direction by addition of a ribonucleoside monophosphate to the 
30 3'-OH end of a growing RNA chain. DNA transcription generates messenger RNAs (mRNA) that 
carry information for protein synthesis, as well as the transfer, ribosomal, and other RNAs that have 
structural or catalytic functions. In eukaryotes, three discrete RNA polymerases synthesize the three 
different types of RNA (Alberts, supra , pp. 367-368). RNA polymerase I makes the large ribosomal 
RNAs, RNA polymerase n makes the mRNAs that will be translated into proteins, and RNA 
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polymerase in makes a variety of small, stable RNAs, including 5S ribosomal RNA and the transfer 
RNAs (tRNA). In all cases, RNA synthesis is initiated by binding of the RNA polymerase to a 
promoter regiop on the DNA and synthesis begins at a start site within the promoter. Synthesis is 
completed at a broad, general stop or termination region in the DNA where both the polymerase and 
5 the completed RNA chain are released. 
Ligases 

DNA repair is the process by which accidental base changes, such as those produced by 
oxidative damage, hydrolytic attack, or uncontrolled methylation of DNA are corrected before 
replication or transcription of the DNA can occur. Because of the efficiency of the DNA repair 

10 process, fewer than one in one thousand accidental base changes causes a mutation (Alberts, supra, 
pp. 245-249). The three steps common to most types of DNA repair are (1) excision of the damaged 
or altered base or nucleotide by DNA nucleases, leaving a gap; (2) insertion of the correct nucleotide 
in this gap by DNA polymerase using the complementary strand as the template; and (3) sealing the 
break left between the inserted nucleotide(s) and the existing DNA strand by DNA ligase. In the last 

15 reaction, DNA ligase uses the energy from ATP hydrolysis to activate the 5* end of the broken * » 
phosphodiester bond before forming the new bond with the 3-OH of the DNA strand. In Bloom's 
syndrome, an inherited human disease, individuals are partially deficient in DNA ligation and 
consequently have an increased incidence of cancer (Alberts, supra , p. 247). 
Nucleases 

20 Nucleases comprise both enzymes that hydrolyze DNA (DNase) and RNA (RNase). They 

serve different purposes in nucleic acid metabolism. Nucleases hydrolyze the phosphodiester bonds 
between adjacent nucleotides either at internal positions (endonucleases) or at the terminal 3' or 5* 
nucleotide positions (exonucleases). A DNA exonuclease activity in DNA polymerase, for example, 
serves to remove improperly paired nucleotides attached to the 3*-OH end of the growing DNA strand 

25 by the polymerase and thereby serves a "proofreading" function. As mentioned above, DNA 
endonuclease activity is involved in the excision step of the DNA repair process. 

RNases also serve a variety of functions. For example, RNase P is a ribonucleoprotein 
enzyme which cleaves the 5' end of pre-tRNAs as part of their maturation process. RNase H digests 
the RNA strand of an RNA/DNA hybrid. Such hybrids occur in cells invaded by retroviruses, and 

3 o RNase His an important enzyme in the retroviral replication cycle. Pancreatic RNase secreted by the 
pancreas into the intestine hydrolyzes RNA present in ingested foods. RNase activity in serum and 
cell extracts is elevated in a variety of cancers and infectious diseases (Schein, C.H. <1997) Nat 
Biotechnol. 15:529-536). Regulation of RNase activity is being investigated as a means to control 
tumor angiogenesis, allergic reactions, viral infection and replication, and fungal infections. . 
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Methylation of specific nucleotides occurs in both DNA and RNA, and serves different 
functions in the two macromolecules. Methylation of cytosine residues to form 5-methyl cytosine in 
DNA occurs specifically at CG sequences which are base-paired with one another in the DNA double- 
5 helix. This pattern of methylation is passed from generation to generation during DNA replication by 
an enzyme called "maintenance methylase" that acts preferentially on those CG sequences that are 1 
1 base-paired with a CG sequence that is already methylated. Such methylation appears to distinguish 

active from inactive genes by preventing the binding of regulatory proteins that "turn on" the gene, 
but permit the binding of proteins that inactivate the gene (Alberts, supra , pp. 448-45 1). IhRNA 
10 metabolism, "tRNA methylase" produces one of several nucleotide modifications in tRNA that affect 
the conformation and base-pairing of the molecule and facilitate the recognition of the appropriate 
mRNA codons by specific tRNAs. Hie primary methylation pattern is the dimethylation of guanine 
residues to form N,N-dimethyl guanine. 
Helicases and Single-Stranded "Rinding Proteins 
'is Helicases are enzymes that destabilize and unwind double helix structures in both DNA and 

RNA. . Since DNA replication occurs more or less simultaneously on both strands, the two strands 
must first separate to generate a replication "fork" for DNA polymerase to act on. Two types of 
replication proteins contribute to this process, DNA helicases and single-stranded binding proteins. 

i 

DNA helicases hydrolyze ATP and use the energy of hydrolysis to separate the DNA strands. Single- 

2 0 stranded binding proteins (SSB s) then bind to the exposed DNA strands without covering the bases, 

thereby temporarily stabilizing them for templating by the DNA polymerase (Alberts, supra , pp. 255- 
256). 

RNA helicases also alter and regulate RNA conformation and secondary structure. Like the 
DNA helicases, RNA helicases utilize energy derived from ATP hydrolysis to destabilize and unwind 
25 RNA duplexes. The most well-characterized and ubiquitous f amily of RNA helicas esistheDEAD- 
box family, so named for the conserved B-type ATP-binding motif which is diagnostic of proteins in 
this family. Over 40 DEAD-box helicases have been identified in organisms as diverse as bacteria, 
insects, yeast, amphibians, mammals, and plants. DEAD-box helicases function in diverse processes 
such as translation initiation, splicing, ribosome assembly, and RNA editing, transport, and stability. 

3 0 Some DEAD-box helicases play tissue- and stage-specific roles in spermatogenesis and 

embryogenesis. Overexpression of the DEAD-box 1 protein (DDX1) may play a role in the 
progression of neuroblastoma (Nb) and retinoblastoma (Rb) tumors (Godbout, R. et al. (1998) J. Biol. 
Chem. 273:21 161-21 168). Hiese observations suggest that DDX1 may promote or enhance tumor 
progression by altering the normal secondary structure and expression levels of RNA in cancer cells. 
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Other DEAD-box helicases have been implicated either directly or indirectly in tumorigenesis 
(Discussed in Godbout, supra) . For example, murine p68 is mutated in ultraviolet light-induced 
tumors, and human DDX6 is located at a chromosomal breakpoint associated with B-cell lymphoma. 
Similarly, a chimeric protein comprised of DDX10 and NUP98, a nucleoporin protein, may be 
5 involved in the pathogenesis of certain myeloid malignancies. . 
Topoisomerases 

Besides the need to separate DNA strands prior to replication, the two strands must be 
"unwound" from one another prior to their separation by DNA helicases. This function is performed 
by proteins known as DNA topoisomerases. DNA topoisomerase effectively acts as a reversible 

10 nuclease that hydrolyzes a phosphodiesterase bond in a DNA strand, permitting the two strands to 
rotate freely about one another to remove the strain of the helix, and then rejoins the original 
phosphodiester bond between the two strands^ Two types of, DNA topoisomerase exist, types I and n. 
DNA Topoisomerase I causes a single-strand break in a DNA helix to allow the rotation of the two 
strands of the helix about the remaining phosphodiester bond in the opposite strand. DNA 

1 5 topoisomerase II causes a transient break in both strands of a DNA helix where two double helices 
cross over one another. This type of topoisomerase can efficiently separate two interlocked DNA 
circles (Alberts, supra, pp.260-262). Type II topoisomerases are largely confined to proliferating cells 
in eukaryotes, such as cancer cells. For this reason they are targets for anticancer drugs. 
Topoisomerase II has been implicated in multi-drug resistance (MDR) as it appears to aid in the repair 

20 of DNA damage inflicted by DNA binding agents such as doxorubicin and vincristine. 
Recombinases 

Genetic recombination is the process of reairanging DNA sequences within an organism's 
genome to provide genetic variation for the organism in response to changes in the environment 
DNA recombination allows variation in the particular combination of genes present in an individual's 

25 genome, as well as the timing and level of expression of these genes (see Alberts, supra , pp. 263- 

273). Two broad classes of genetic recombination are commonly recognized, general recombination 
and site-specific recombination. General recombination involves genetic exchange between any 
homologous pair of DNA sequences usually located on two copies of the same chromosome. The 
process is aided by enzymes called recombinases that "nick" one strand of a DNA duplex more or less 

3 o randomly and permit exchange with the complementary strand of another duplex. The process does 
not normally change the arrangement of genes on a chromosome. In site-specific recombination, the 
recombinase recognizes specific nucleotide sequences present in one or both of the recombining 
molecules. Base-pairing is not involved in this form of recombination and therefore does not require 
DNA homology between the recombining molecules. Unlike general recombination, this form of 
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recombination can alter the relative positions of nucleotide sequences in chromosomes. 
Splicing Factors ( 

, Various proteins are necessary for processing of transcribed RNAs in the nucleus. Pre- 
mRNA processing steps include capping at the 5' end with methylguanosine, polyadenylating the 3' 
5 end, and splicing to remove introns. The primary RNA transcript from DNA is a faithful copy of the 
gene containing both exon and intron sequences, and the latter sequences must be cut out of the RNA 
transcript to produce an mRNA that codes for a protein. This "splicing" of the mRNA sequence takes 
place in the nucleus with the aid of a large, multicomponent ribonucleoprotein complex known as a 
spliceosome. The spliceosomal complex is composed of five small nuclear ribonucleoprotein 

10 particles (snRNPs) designated Ul, U2, y4, U5, and U6, and a number of additional proteins. Each 
snRNP contains a single species of snRNA and about ten proteins. The RNA components of some 
snRNPs recognize and base pair ;with intron consensus sequences. The protein components mediate 
spliceosome assembly and the splicing reaction. Autoantibodies to snRNP proteins are found in the 
blood of patients with systemic lupus erythematosus (Stryer, L. (1995) Biochemistry. W.H. Freeman 

1 1 5 and Company, New York NY, p. 863). 

Adhesion Molecules 

The surface of a cell is rich in transmembrane proteoglycans, glycoproteins, glycolipids, and 
receptors. These macromolecules mediate adhesion with other cells and with components of the 
2 o extracellular matrix (ECM). The interaction of the cell with its surroundings profoundly influences 
cell shape, strength, flexibility, motility, and adhesion. These dynamic properties are intimately 
associated with signal transduction pathways controlling cell proliferation and differentiation, tissue 
construction, and embryonic development 
Cadherins 

i 

2 5 Cadherins comprise a family of calcium-dependent glycoproteins that function in mediating 

cell-cell adhesion in virtually all solid tissues of multicellular organisms. These proteins share 
multiple repeats of a cadherin-specific motif, and the repeats form the folding units of the cadherin 
extracellular domain. Cadherin molecules cooperate to form focal contacts, or adhesion plaques, 
between adjacent epithelial cells. The cadherin family includes the classical cadherins and 

3 o protocadherins. Classical cadherins include the E-cadherin, N-cadherin, and P-cadherin subfamilies. 

E-cadherin is present on many types of epithelial cells and is especially important for embryonic 
development N-cadherin is present on nerve, muscle, and lens cells and is also critical for embryonic 
development P-cadherin is present on cells of the placenta and epidermis. Recent studies report that 
protocadherins are involved in a variety of cell-cell interactions (Suzuki, S.T. (1996) J. Cell Sci. 
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109:2609-261 1). The intracellular anchorage of cadherins is regulated by their dynamic association 
with catenins, a family of cytoplasmic signal transduction proteins associated with the actin 
cytoskeleton. The anchorage of cadherins to the actin cytoskeleton appears to be regulated by protein 
tyrosine phosphorylation, and the cadherins are the target of phosphorylation-induced junctional 
5 disassembly (Aberle, H. et al. (1996) J. Cell. Biochem. 61:514-523). 
Integrins 

Integrins are ubiquitous transmembrane adhesion molecules that link the ECM to the internal 
cytoskeleton. Integrins are composed of two noncovalently associated transmembrane glycoprotein 
subunits called a and p. Integrins function as receptors that play a role in signal transduction. For 

i o example, binding of integrin to its extracellular ligand may stimulate changes in intracellular calcium 
levels or protein kinase activity (Sjaastad, M.D. and W.J. Nelson (1997) BioEssays 19:47-55). At 
least ten cell surface receptors of the integrin family recognize the ECM component fibronectin, 
which is involved in many different biological processes including cell migration and embryogenesis 

< (Johansson, S. et al. (1997) Front. Biosci. 2:D126-D146). 

15 Lectins 

Lectins comprise a ubiquitous family of extracellular glycoproteins which bind cell surface 
carbohydrates specifically and reversibly, resulting in the agglutination of cells (reviewed in 
Drickamer, K. and M.E. Taylor (1993) Annu. Rev. Cell Biol. 9:237-264). This function is 
particularly important for activation of the immune response. Lectins mediate the agglutination and 

20 mitogenic stimulation of lymphocytes at sites of inflammation (Lasky, L.A. (1991) J. Cell. Biochem. 
45:139-146; Paietta, E. et al. (1989) J. Immunol. 143:2850-2857). 

Lectins are further classified into subfamilies based on carbohydrate-binding specificity and 
other criteria. The galectin subfamily, in particular, includes lectins that bind p-galactoside 
carbohydrate moieties in a thiol-dependent manner (reviewed in Hadari, Y.R. et al. (1998) J. Biol. 

25 Chem. 270:3447-3453). Galectins are widely expressed and developmentally regulated. Because all 
galectins lack an N-terminal signal peptide, it is suggested that galectins are externalized through an 
atypical secretory mechanism. Two classes of galectins have been defined based on molecular weight 
and oligomerization properties. Small galectins form homodimers and are about 14 to 16 kilodaltons 
in mass, while large galectins are monomelic and about 29-37 kilodaltons. 

3 o Galectins contain a characteristic carbohydrate recognition domain (CRD). The CRD is 

about 140 amino acids and contains several stretches of about 1-10 amino acids which are highly 
conserved among all galectins. A particular 6-amino acid motif within the CRD contains conserved 
tryptophan and arginine residues which are critical for carbohydrate binding. The CRD of some 
galectins also contains cysteine residues which may be important for disulfide bond formation. 
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Secondary structure predictions indicate that the CRD forms several 0-sheets. 

Galectins play a number of roles in diseases and conditions associated with cell-cell and cell- 
matrix interactions. For example, certain galectins associate with sites of inflammation and bind to 
cell surface immunoglobulin E molecules. In addition, galectins may play an important role in cancer 
5 metastasis. Galectin overexpression is correlated with the metastatic potential of cancers in humans 
and mice. Moreover, anti-galectin antibodies inhibit processes associated with cell transformation, 
such as cell aggregation and anchorage-independent growth (See, for example, Su, Z.-Z. et al. (1996) 
Proc. NatL Acad. Sci. USA 93:7252-7257). 
Selectins 

10 Selectins, or LEC-CAMs, comprise a specialized lectin subfamily involved primarily in 

inflammation and leukocyte adhesion (Reviewed in Lasky, supra) . Selectins mediate the recruitment 
of leukocytes from th^ circulation to sites of acute inflammation and are expressed on the surface of 

vascular endothelial cells in response to cytokine signaling. Selectins bind to specific ligands on the 

• . 'i 

leukocyte cell membrane and enable the leukocyte to adhere to and migrate along the endothelial 
15 surface. Binding of selectin to its ligand leads to polarized rearrangement of the actin cytoskeleton 

and stimulates signal transduction within the leukocyte (Brenner, B. et al. (1997) Biochem. Biophys. 

Res. Commun. 231:802-807; Hidari, KJL et al. (1997) J. Biol. Chem. 272:28750-28756). Members 

of the selectin family possess three characteristic motifs: a lectin or carbohydrate recognition domain; 

an epidermal growthfactor-like domain; and a variable number of short consensus repeats (scr or 
20 "sushi" repeats) which are also present in complement regulatory proteins. The selectins include 

lymphocyte adhesion molecule- 1 (Lam-1 or L-selectin), endothelial leukocyte adhesion molecule- 1 

(ELAM-1 or E-selectin), and granule membrane protein-140 (GMP-140 or P-selectin) (Johnston, G J. 

et al. (1989) Cell 56:1033-1044). 

25 Antigen Recognition Molecules 

All vertebrates have developed sophisticated and complex immune systems that provide 
protection from viral, bacterial, fungal, and parasitic infections. A key feature of the immune system 
is its ability to distinguish foreign molecules, or antigens, from "self' molecules. This ability is 
mediated primarily by secreted and transmembrane proteins expressed by leukocytes (white blood 

3 o cells) such as lymphocytes, granulocytes, and monocytes. Most of these proteins belong to the 
immunoglobulin (Ig) superfamily, members of which contain one or more repeats of a conserved 
structural domain. This Ig domain is comprised of antiparallel p sheets joined by a disulfide bond in " 
an arrangement called the Ig fold. Members of the Ig superfamily include T-cell receptors, major 
histocompatibility (MHC) proteins, antibodies, and immune cell-specific surface markers such as 
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CD4, CD8, and CD28. 

MHC proteins are cell surface markers that bind to and present foreign antigens to T cells. 
MHC molecules are classified as either class I or class IL Class I MHC molecules (MHC I) are 
expressed on the surface of almost all cells and are involved in the presentation of antigen to 
5 cytotoxic T cells. For example, a cell infected with virus will degrade intracellular viral proteins and 
express the protein fragments bound to MHC I molecules on th$ cell surface. The MHC I/antigen 
complex is recognized by cytotoxic T-cells which destroy the; infected cell and the virus within. 
Class II MHC molecules are expressed primarily on specialized antigen-presenting cells of the 
immune system, such as B-cells and macrophages. These cells ingest foreign proteins from the 

1 o extracellular fluid and express MHC II/antigen complex on the cell surface. This complex activates 

helper T-cells, which then secrete cytokines and other factors that stimulate the immune response. 
MHC molecules also play an important role in organ rejection following transplantation. Rejection 
occurs when the recipient's T-cells respond to foreign MHC molecules on the transplanted organ in 
the same way as to self MHC molecules bound to foreign antigen. (Reviewed in Alberts, B. et al. 
15 (1994) Molecular Biology of the Cell. Garland Publishing, New York NY, pp. 1229-1246.) 

Antibodies, or immunoglobulins, are either expressed on the surface of B-cells or secreted by 
B-cells into the circulation. Antibodies bind and neutralize foreign antigens in the blood and other 
extracellular fluids. The prototypical antibody is a tetramer consisting of two identical heavy * 
polypeptide chains (H-chains) and two identical light polypeptide chains (L-chains) interlinked by 

2 o disulfide bonds. This arrangement confers the characteristic Y-shape to antibody molecules. 

Antibodies are classified based on their H-chain composition. The five antibody classes, IgA, IgD, 
IgE, IgG and IgM, are defined by the a, 8, €, y 9 and jx H-chain types. There are two types of L- 
chains, k and A, either of which may associate as a pair with any H-chain pair. IgG, the most 
common class of antibody found in the circulation, is tetrameric, while the other classes of antibodies 

25 are generally variants or multimers of this basic structure. 

H-chains and L-chains each contain an N-terminal variable region and a C-terminal constant 
region. Hie constant region consists of about 1 10 amino acids in L-chains and about 330 or 440 
amino acids in H-chains. The amino acid sequence of the constant region is nearly identical among 
H- or L-chains of a particular class. The variable region consists of about 110 amino acids in both H- 

30 and L-chains. However, the amino acid sequence of the variable region differs among H- or L-chains 
of a particular class. Within each H- or L-chain variable region are three hypervariable regions of 
extensive sequence diversity, each consisting of about 5 to 10 amino acids. In the antibody molecule, 
the H- and L-chain hypervariable regions come together to form the antigen recognition site. 
(Reviewed in Alberts, supra , pp. 1206-1213 and 1216-1217.) 
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Both H-chains and L-chains contain repeated Ig domains. For example, a typical H-chain 
contains four Ig domains, three of which occur within the constant region and one of which occurs 
within the variable region and contributes to the formation of the antigen recognition site. Likewise, 
a typical L-chain contains two Ig domains, one of which occurs within the constant region and one of 
5 which occurs within the variable region. 

The immune system is capable of recognizing and responding to any foreign molecule that 1 
enters the body. Therefore, the immune system must be armed with a full repertoire of antibodies 
against all potential antigens. Such antibody diversity is generated by somatic rearrangement of gene 
segments encoding variable and constant regions. These gene segments are joined together by site- 

1 o specific recombination which occurs between highly conserved DNA sequences that flank each gene 

segment Because there are hundreds of different gene segments, millions of unique genes can be 
generated combinatorially . In addition, imprecise joining of these segments and an unusually high 
rate of somatic mutation within these segments further contribute to the generation of a diverse 
antibody population. 

15 T-cell receptors are both structurally and functionally related to antibodies. (Reviewed in 

Alberts, supra , pp. 1228-1229.) T-cell receptors are cell surface proteins that bind foreign antigens and 
mediate diverse aspects of the immune response. A typical T-cell receptor is a heterodimer comprised 
of two disulfide-linked polypeptide chains called a and p. Each chain is about 280 amino acids in 
length and contains one variable region and one constant region Each variable or constant region folds 

20 into an Ig domain. The variable regions from the a and P chains come together in the heterodimer to 
form the antigen recognition site. T-cell receptor diversity is generated by somatic rearrangement of 
gene segments encoding the a and p chains. T-cell receptors recognize small peptide antigens that are 
expressed on the surface of antigen-presenting cells and pathogen-infected cells. These peptide antigens 

are presented on the cell surface in association with major histocompatibility proteins which provide the 

i 

2 5 proper context for antigen recognitioa 



Secreted and Extracellular Matrix Molecules 

Protein secretion is essential for cellular functioa Protein secretion is mediated by a signal 
peptide located at the amino terminus of the protein to be secreted. The signal peptide is comprised of 
3 o about ten to twenty hydrophobic amino acids which target the nascent protein from the ribosome to the 
endoplasmic reticulum (ER). Proteins targeted to the ER may either proceed through the secretory 
pathway or remain in any of the secretory organelles such as theER, Golgi apparatus, or lysosomes. 
Proteins that transit through the secretory pathway are either secreted into the extracellular space or 
retained in the plasma membrane. Secreted proteins are often synthesized as inactive precursors that 
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are activated by post-translational processing events during transit through the secretory pathway. 

i 

Such events include glycosylation, proteolysis, and removal of the signal peptide by a signal peptidase. 
Other events that may occur during protein transport include chaperone-dependent unfolding and 
folding of the nascent protein and interaction of the protein with a receptor or pore complex. Examples 
5 of secreted proteins with amino terminal signal peptides include receptors, extracellular matrix 

molecules, cytokines, hormones, growth and differentiation factors, neuropeptides, vasomediators, ion 
channels, transporters/pumps, and proteases. (Reviewed in Alberts, B. et al. (1994) Molecular Biology 
of The Cell . Garland Publishing, New York NY, pp. 557-560, 582-592.) 

The extracellular matrix (ECM) is a complex network of glycoproteins, polysaccharides, 

l o proteoglycans, and other macromolecules that are secreted from the cell into the extracellular space. 
The ECM remains in close association with the cell surface and provides a supportive meshwork that 
profoundly influences cell shape, motility, strength, flexibility, and adhesion. In fact, adhesion of a 
cell to its surrounding matrix is required for cell survival except in the case of metastatic tumor cells, 
which have overcome the need for cell-ECM anchorage. This phenomenon suggests that the ECM 

1 5 plays a critical role in the molecular mechanisms of growth control and metastasis, (Reviewed in 
Ruoslahti, E. (1996) Sci. Am 275:72-77.) Furthermore, the ECM determines the structure and 
physical properties of connective tissue and is particularly important for morphogenesis and other 
processes associated with embryonic development and pattern formatioa 

The collagens comprise a family of ECM proteins that provide structure to bone, teeth, skin, 

20 ligaments, tendons, cartilage, blood vessels, and basement membranes. Multiple collagen proteins have 
been identified. Three collagen molecules fold together in a triple helix stabilized by interchain disulfide 
bonds. Bundles of these triple helices then associate to form fibrils. Collagen primary structure 
consists of hundreds of (Gly-X- Y) repeats whore about a third of the X and Y residues are Pro. 
Glycines are crucial to helix formation as the bulkier amino acid sidechains cannot fold into the triple 

25 helical conformation. Because of these strict sequence requirements, mutations in collagen genes have 
severe consequences. Osteogenesis imperfecta patients have brittle bones that fracture easily; in severe 
cases patients die in utero or at birth. Ehlers-Danlos syndrome patients have hyperelastic skin, 
hypermobile joints, and susceptibility to aortic and intestinal rupture. Chondrodysplasia patients have 
short stature and ocular disorders. Alport syndrome patients have hematuria, sensorineural deafness, 

30 and eye lens deformation. (Isselbacher, K.J. et al. (1994) Harrison's Principles of Internal Medicine , 
McGraw-Hill, Inc., New York NY, pp. 2105-2117; and Creighton, T.E. (1984) Proteins. Structures 
and Molecular Principles . W.H. Freeman and Company, New York NY, pp. 191-197.) 

Elastin and related proteins confer elasticity to tissues such as skin, blood vessels, and lungs. 
Elastin is a highly hydrophobic protein of about 750 amino acids that is rich in proline and glycine 
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residues. Elastin molecules are highly cross-linked, forming an extensive extracellular network of fibers 
and sheets. Elastin libers are surrounded by a sheath of microfibrils which are composed of a number 
of glycoproteins, including fibrillin. Mutations in the gene encoding fibrillin are responsible for 
Marfan* s syndrome,, a genetic disorder characterized by defects in connective tissue. In severe cases, 
5 the aortas of afflicted individuals are prone to rupture. (Reviewed in Alberts, supra , pp. 984-986.) 

Fibronectin is a large ECM glycoprotein found in all vertebrates. Fibronectin exists as a dimer 
of two subunits, each containing about 2,500 amino acids. Each subunit folds into a rod-like structure 
containing multiple domains. Hie domains each contain multiple repeated modules, the most common 
of which is the type in fibronectin repeat The type III fibronectin repeat is about 90 amino acids in 
10 length and is also found in other ECM proteins and in some plasma membrane and cytoplasmic 

proteins. Furthermore, some type HI fibronectin repeats contain a characteristic tripeptide consisting of 

Arginine-Glycine-Aspartic acid (RGD). The RGD sequence is recognized by the integrin family of cefl 

i 

surface receptors and is also found in other ECM proteins. Disruption of both copies of the gene 
encoding fibronectin causes early embryonic lethality in mice. The mutant embryos display extensive 

1 5 morphological defects, including defects in the formation of the notochord, somites, heart, blood 
vessels, neural tube, and extraembryonic structures. (Reviewed in Alberts, supra, pp. 986-987;) 

Laminin is a major glycoprotein component of the basal lamina which underlies and supports 
epithelial cell sheets. Laminin is one of the first ECM proteins synthesized in the developing embryo. 
Laminin is an 850 Jdlodalton protein composed of three polypeptide chains joined in the shape of a 

2 o cross by disulfide bonds. Laminin is especially important for angiogenesis and in particular, for 
guiding the formation of capillaries. (Reviewed in Alberts, supra , pp. 990-99 1 .) 

There are many other types of proteinaceous ECM components, most of which can be 
classified as proteoglycans. Proteoglycans are composed of unbranched polysaccharide chains 
(gjycosaminoglycans) attached to protein cores. Common proteoglycans include aggrecan, betaglycan, 

2 5 decorin, perlecan, serglycin, and syndecan-1 . Some of these molecules not only provide mechanical 
support, but also bind to extracellular signaling molecules, such as fibroblast growth factor and 
transforming growth factor P, suggesting a role for proteoglycans in cell-cell communication and cell 
growth. (Reviewed in Alberts, supra , pp. 973-978.) Likewise, the glycoproteins tenascin-C and 
tenascin-R are expressed in developing and lesioned neural tissue and provide stimulatory and anti- 

30 adhesive (inhibitory) properties, respectively, for axonal growth. (Faissner, A. (1997) Cell Tissue Res. 
290:331-341.) 

Cytoskeletal Molecules 

The cytoskeleton is a cytoplasmic network of protein fibers that mediate cell shape, structure, 
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and movement The cytoskeleton supports the cell membrane and forms tracks along which 
organelles and other elements move in the cytosol. Hie cytoskeleton is a dynamic structure that 
allows cells to adopt various shapes and to carry out directed movements. Major cytoskeletal fibers 
include the microtubules, the microfilaments, and the intermediate filaments. Motor proteins, 
5 including myosin, dynein, and kinesin, drive movement of or along the fibers. The motor protein ■ 
dynamin drives the formation of membrane vesicles. Accessory or associated proteins modify the 
structure or activity of the fibers while cytoskeletal membrane anchors connect the fibers to the cell 
membrane. 
Tubulins 

1 o Microtubules, cytoskeletal fibers with a diameter of about 24 nm, have multiple roles in the 

cell. Bundles of microtubules form cilia and flagella, which are whip-like extensions of the cell 
membrane that are necessary for sweeping materials across an epithelium and for swimming of 
sperm, respectively. Marginal bands of microtubules in red blood cells and platelets are important for 
| these cells' pliability. Organelles, membrane vesicles, and proteins are transported in the cell along 
15 tracks of microtubules. For example, microtubules run through nerve ceil axons, allowing bi- 
directional transport of materials and membrane vesicles between the cell body and the nerve 
terminal. Failure to supply the nerve terminal with these vesicles blocks the transmission of neural 
signals. Microtubules are also critical to chromosomal movement during cell division. Both stable' 
and short-lived populations of microtubules exist in the cell. 

2 o Microtubules are polymers of GTP-binding tubulin protein subunits. Each subunit is a 

heterodimer of a- and p- tubulin, multiple isoforms of which exist The hydrolysis of GTP is linked 
to the addition of tubulin subunits at the end of a microtubule. The subunits interact head to tail to 
form protofilaments; the protofilaments interact side to side to form a microtubule. A microtubule is 
polarized, one end ringed with oc-tubulin and the other with p-tubulin, and the two ends differ in their 
25 rates of assembly. Generally, each microtubule is composed of 13 protofilaments although 1 1 or 15 
protofilament-microtubules are sometimes found. Cilia and flagella contain doublet microtubules. 
Microtubules grow from specialized structures known as centrosomes or micro tubule-org anizing 
centers (MTOCs). MTOCs may contain one or two centrioles, which are pinwheel arrays of triplet 
microtubules. The basal body, the organizing center located at the base of a cilium or flagellum, 

3 o contains one centriole. Gamma tubulin present in the MTOC is important for nucleating the 

polymerization of a- and p- tubulin heterodimers but does not polymerize into microtubules. 
Microtubule- Associated Proteins 

Microtubule-associated proteins (MAPs) have roles in the assembly and stabilization of 
microtubules. One major family of MAPs, assembly MAPs, can be identified in neurons as well as 

87 



BNSDOCID: <WO_0162927A2J_> 



WO 01/62927 




PCT7US01/06059 



non-neuronal cells. Assembly MAPs are responsible for cross-linking microtubules in the cytosol 
These MAPs are organized into two domains: a basic microtubule-binding domain and an acidic 
projection domain. The projection domain is the binding site for membranes, intermediate filaments, or 
other microtubules, Based on sequence analysis, assembly MAPs can be further grouped into two 
5 types: Type I and Type EL Type I MAPs, which include MAPI A and MAP1B, are large, filamentous 
molecules that co-purify with microtubules and are abundantly expressed in brain and testes. Type I 
MAPs contain several repeats of a positively-charged amino acid sequence motif that binds, and 
neutralizes negatively charged tubulin, leading to stabilization of microtubules. MAPI A and MAP1B 
are each derived from a single precursor polypeptide that is subsequently proteolytically processed to 

10 generate one heavy chain and one light chain. 

Another light chain, LC3, is a 16.4 kDa molecule that binds MAPI A, MAP1B, and 
microtubules. It is suggested that LC3 is synthesized from a source other than the MAPI A or MAP1B 
transcripts, and that the expression of LC3 may be important in regulating the microtubule binding 
activity of MAPI A and MAP1B during cell proliferation (Mann, S.S. et al. (1994) J. Biol. Chem. 

15 269:11492-11497). 

Type II MAPs, which include MAP2a, MAP2b, MAP2c, MAP4, and Tau, are characterized 
by three to four copies of an 1 8-residue sequence in the miaotubule-binding domain. MAP2a, MAP2b, 
and MAP2c are found only in dendrites, MAP4 is found in non-neuronal cells, and Tau is found in 
axons and dendrites 6f nerve cells. Alternative splicing of the Tau mRNA leads to the existence of 

2 o multiple forms of Tau protein. Tau phosphorylation is altered in neurodegenerative disorders such as 
Alzheimer's disease, Pick's disease, progressive supranuclear palsy, corticobasal degeneration, and 
familial frontotemporal dementia and Parkinsonism linked to chromosome 17. The altered Tau 
phosphorylation leads to a collapse of the microtubule network and the formation of intraneuronal 
Tau aggregates (Spillantini, M.G. and M. Goedert (1998) Trends Neurosci. 21:428-433). 

2 5 The protein pericentrin is found in the MTOC and has a role in microtubule assembly. 

Actins 

Microfilaments, cytoskeletal filaments with a diameter of about 7-9 nm, are vital to cell 
locomotion, cell shape, cell adhesion, cell division, and muscle contraction. Assembly and 
disassembly of the microfilaments allow cells to change their morphology. Microfilaments are the 

3 o polymerized form of actin, the most abundant intracellular protein in the eukaryotic cell. Human cells 

contain six isoforms of actin. The three a-actins are found in different kinds of muscle, nonmuscle P- 
actin and nonmuscle y-actin are found in nonmuscle cells, and another y -actin is found in intestinal 
smooth muscle cells. G-actin, the monomelic form of actin, polymerizes into polarized, helical F- 
actin filaments, accompanied by the hydrolysis of ATP to ADP. Actin filaments associate to form 

88 



BNSDOCID: <WO 0162927A2J_> 



WO 01/62927 PCT/USO 1/06059 

bundles and networks, providing a framework to support the plasma membrane and determine cell 
shape. These bundles and networks are connected to the cell membrane. In muscle cells, thin 
filaments containing actin slide past thick filaments containing the motor protein myosin during 
contraction. A family of actin-related proteins exist that are not part of the actin cytoskeleton, but 
5 rather associate with microtubules and dynein. 
Actin-Associated Proteins 

Actin-associated proteins have roles in cross-linking, severing, and stabilization of actin 
filaments and in sequestering actin monomers. Several of the actin-associated proteins have multiple 
functions. Bundles and networks of actin filaments are held together by actin cross-linking proteins. 

10 These proteins have two actin-binding sites, one for each filament Short cross-linking proteins 
promote bundle formation while longer, more flexible cross-linking proteins promote network 
formation. Calmodulin-like calcium-binding domains in actfn cross-linking proteins allow calcium 
regulation of cross-linking. Group I cross-linking proteins have unique actin-binding domains and 
include the 30 kD protein, EF-la, fascin, and scruin. Group II cross-linking proteins have a 7,000- 

15 MW actin-binding domain and include villin and dematin. Group in cross-linking proteins have 

pairs of a 26,000-MW actin-binding domain and include finabrin, spectrin, dystrophin, ABP 120, and 
filamin. 

Severing proteins regulate the length of actin filaments by breaking them into short pieces or 
by blocking their ends. Severing proteins include gCAP39, severin (fragmin), gelsolin, and villin. 

2 o Capping proteins can cap the ends of actin filaments, but cannot break filaments. Capping proteins 
include CapZ and tropomodulin. The proteins thymosin and profilin sequester actin monomers in the 
cytosol, allowing a pool of unpolymerized actin to exist The actin-associated proteins tropomyosin, 
troponin, and caldesmon regulate muscle contraction in response to calcium. 
Intermediate Filaments and Associated Proteins 

25 Intermediate filaments (IFs) are cytoskeletal fibers with a diameter of about 10 nm, 

intermediate between that of microfilaments and microtubules. IFs serve structural roles in the cell, 
reinforcing cells and organizing cells into tissues. IFs are particularly abundant in epidermal cells and 
in neurons. IFs are extremely stable, and, in contrast to microfilaments and microtubules, do not 
function in cell motility. 

3 o Five types of IF proteins are known in mammals. Type I and Type II proteins are the acidic 

and basic keratins, respectively. Heterodimers of the acidic and basic keratins are the building blocks 
of keratin IFs. Keratins are abundant in soft epithelia such as skin and cornea, hardepithelia such as 
nails and hair, and in epithelia that line internal body cavities. Mutations in keratin genes lead to 
epithelial diseases including epidermolysis bullosa simplex, bullous congenital ichthyosiform 



89 



WO 01/62927 




PCT/US01/06059 



erythroderma (epidermolytic hyperkeratosis), non-epidermolytic and epidennolytic palmoplanar 
keratoderma, ichthyosis bullosa of Siemens, pachyonychia congenita, and white sponge nevus. Some of 
these diseases result in severe skin blistering. (See> e.g., Wawersik, M. et al. (1997) J. Biol. Chem. 
272:32557-32565; and CordenL.D. and W.H. McLean (1996) Exp. Dermatol. 5:297-307.) 
5 Type III IF proteins include desmin, glial fibrillary acidic protein, vimentin, and peripheral. 

Desmin filaments in muscle cells link myofibrils into bundles and stabilize sarcomeres in contracting- 
muscle. Glial fibrillary acidic protein filaments are found in the glial cells that surround neurons and 
astrocytes. Vimentin filaments are found in blood vessel endothelial cells, some epithelial cells, and 
mesenchymal cells such as fibroblasts, and are commonly associated with microtubules. Vimentin 

10 filaments may have roles in keeping the nucleus and other organelles in place in the cell. Type IV IFs 
include the neurofilaments and nestin. Neurofilaments, composed of three polypeptides NF-L, NF-M, 
and NF-H, are frequently associated with microtubules in axons. Neurofilaments are responsible for 
the radial growth and diameter of an axon, and ultimately for the speed of nerve impulse transmission. 
Changes in phosphorylation and metabolism of neurofilaments are observed in neurodegenerative 

'15 diseases including amyotrophic lateral sclerosis, Parkinson's disease, and Alzheimer's disease (Julien, 
J.P. and W.E. Mushynski (1998) Prog. Nucleic Acid Res. Mol. Biol. 61:1-23). Type V IFs, the lamins, 
are found in the nucleus where they support the nuclear membrane. 

IFs have a central a-helical rod region interrupted by short nonhelical linker segments. The rod 
region is bracketed, in most cases, by non-helical head and tail domains. The rod regions of 

2 o intermediate filament proteins associate to form a coiled-coil dimer. A highly ordered assembly process 
leads from the dimers to the IFs. Neither ATP nor GTP is needed for IF assembly, unlike that of 
microfilaments and microtubules. 

EF-associated proteins (IFAPs) mediate the interactions of IFs with one another and with other 
cell structures. IFAPs cross-link IFs into a bundle, into a network, or to the plasma membrane, and 

2 5 may cross-link IFs to the microfilament and microtubule cytoskeleton. Microtubules and IFs are in 
particular closely associated. IFAPs include BPAG1 , plakoglobin, desmoplakin I, desmoplakin n, 
plectin, ankyrin, filaggrin, and lamin B receptor. 
Cvtoskeletal-Membrane Anchors 

Cytoskeletal fibers are attached to the plasma membrane by specific proteins. These 

30 attachments are important for maintaining cell shape and for muscle contraction. In erythrocytes, the 
spectrin-actin cytoskeleton is attached to cell membrane by three proteins, band 4.1, ankyrin, and 
adducin. Defects in this attachment result in abnormally shaped cells which are more rapidly 
degraded by the spleen, leading to anemia. In platelets, the spectrin-actin cytoskeleton is also linked 
to the membrane by ankyrin; a second actin network is anchored to the membrane by filamin. In 
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muscle cells the protein dystrophin links actin filaments to the plasma membrane; mutations in the 
dystrophin gene lead to Duchenne muscular dystrophy. In adherens junctions and adhesion plaques 
the peripheral membrane proteins cc-actinin and vinculin attach actin filaments to the cell membrane. 

DFs are also attached to membranes by cytoskeletal-membrane anchors. Hie nuclear lamina is 
5 attached to the inner surface of the nuclear membrane by the lamin B receptor. Vimentin IFs are • 
attached to the plasma membrane by ankyrin and plectin. Desmosome and hemidesmosome 
membrane junctions hold together epithelial cells of organs and skin. These membrane junctions 
allow shear forces to be distributed across the entire epithelial cell layer, thus providing strength and 
rigidity to the epithelium. IFs in epithelial cells are attached to the desmosome by plakoglobin and 

1 o desmoplakins. The proteins that link IFs to hemidesmosomes are not known. Desmin IFs surround 

the sarcomere in muscle and are linked to the plasma membrane by paranemin, synemin, and ankyrin. 
Mvosin-related Motor Proteins 

Myosins are actin-activated ATPases, found in eukaryotic cells, that couple hydrolysis of 
' ATP with motion. Myosin provides the motor function for muscle contraction and intracellular 
is movements such as phagocytosis and rearrangement of cell contents during mitotic cell division t 
(cytokinesis). The contractile unit of skeletal muscle, termed the sarcomere, consists of highly ordered 
arrays of thin actin-containing filaments and thick myosin-containing filaments. Crossbridges form 
between the thick and thin filaments, and the ATP-dependent movement of myosin heads within the - - 
thick filaments pulls the thin filaments, shortening the sarcomere and thus the muscle fiber. 
20 Myosins are composed of one or two heavy chains and associated light chains. Myosin 

heavy chains contain an amino-terminal motor or head domain, a neck that is the site of light-chain 
binding, and a carboxy-terminal tail domain. The tail domains may associate to form an oc-helical 
coiled coil. Conventional myosins, such as those found in muscle tissue, are composed of two 
myosin heavy-chain subunits, each associated with two light-chain subunits that bind at the neck 

2 5 region and play a regulatory role. Unconventional myosins, believed to function in intracellular 

motion, may contain either one or two heavy chains and associated light chains. There is evidence for 
about 25 myosin heavy chain genes in vertebrates, more than half of them unconventional, 
Dvnein-related Motor Proteins 

Dyneins are (-) end-directed motor proteins which act on microtubules. Two classes of 

3 o dyneins, cytosolic and axonemal, have been identified. Cytosolic diyneins are responsible for 

translocation of materials along cytoplasmic microtubules, for example, transport from the nerve 
terminal to the cell body and transport of endocytic vesicles to lysosomes. Cytoplasmic dyneins are 
also reported to play a role in mitosis. Axonemal dyneins are responsible for the beating of flagella and 
cilia. Dynein on one microtubule doublet walks along the adjacent microtubule doublet This sliding 
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force produces bending forces that cause the flagellum or cilium to heat Dyneins have a native mass 
between 1000 and 2000 kDa and contain either two or three forc&producing heads driven by the 
hydrolysis of ATP. The heads are linked via stalks to a basal domain which is composed of a highly 
variable number of accessory intermediate and light chains. 
5 Kinesin-related Motor Proteins 

Kinesins are (+) end-directed motor proteins which act on microtubules. The prototypical 
kinesin molecule is involved in the transport of membrane-bound .vesicles and organelles. This function 
is particularly important for axonal transport in neurons. Kinesin is also important in all cell types for 
the transport of vesicles from the Golgi complex to the endoplasmic reticulum. This role is critical for 
10 maintaining the identity and functionality of these secretory organdies. 

Kinesins define a ubiquitous, conserved family of over 50 proteins that can be classified into at 
least 8 subfamilies ba^ed on primary amino acid sequence, domain structure, velocity of movement, and 

cellular function. (Reviewed in Moore, J.D. and S.A. Endow (1996) Bioessays 18:207-219; and Hoyt, 

1 f 

AM. (1994) Curr. Opin. Cell Biol. 6:63-68.) The prototypical kinesin molecule is a heterotetramer 
15 comprised of two heavy polypeptide chains (KHCs) and two light polypeptide chains (KLCs). The 
KHC subunits are typically referred to as "kinesin." KHC is about 1000 amino acids in length, and 
KLC is about 550 amino acids in length. Two KHCs dimerize to form a rod-shaped molecule with 
three distinct regions of secondary structure. At one end of the molecule is a globular motor domain 
that functions in ATP hydrolysis and microtubule binding. Kinesin motor domains are highly conserved 
20 and share over 70% identity. Beyond the motor domain is an a-helical coiled-coil region which 
mediates dimerization. At the other end of the molecule is a fan-shaped tail that associates with 
molecular cargo. The tail is formed by the interaction of the KHC C -termini with the two KLCs. 

Members of the more divergent subfamilies of kinesins are called kinesin-related proteins 
(KRPs), many of which function during mitosis in eukaryotes (Hoyt supra) . Some KRPs are required 

2 5 for assembly of the mitotic spindle. In vivo and in vitro analyses suggest that these KRPs exert force 

on microtubules that comprise the mitotic spindle, resulting in the separation of spindle poles. 
Phosphorylation of KRP is required for this activity. Failure to assemble the mitotic spindle results in 
abortive mitosis and chromosomal aneuploidy, the latter condition being characteristic of cancer cells. 
In addition, a unique KRP, centromere protein E, localizes to the Mnetochore of human mitotic 

3 o chromosomes and may play a role in their segregation to opposite spindle poles. 

Dvnamin-related Motor Proteins 

Dynamin is a large GTPase motor protein that functions as a "molecular pinchase," 
generating a mechanochemical force used to sever membranes. This activity is important in forming 
clathrin-coated vesicles from coated pits in endocytosis and in the biogenesis of synaptic vesicles in 
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neurons. Binding of dynamin to a membrane leads to dynamin' s self-assembly into spirals that may 
act to constrict a flat membrane surface into a tubule. GTP hydrolysis induces a change in 
conformation of the dynamin polymer that pinches the membrane tubule, leading to severing of the 
membrane tubule and formation of a membrane vesicle. Release of GDP and inorganic phosphate 
5 leads to dynamin disassembly. Following disassembly the dynamin may either dissociate from the 
membrane or remain associated to the vesicle and be transported to, another region of the cell. Three 
homologous dynamin genes have been discovered, in addition to several dynamin-related proteins. 
Conserved dynamin regions are the N-terminal GTP-binding domain, a central pleckstrin homology 
domain that binds membranes, a central coiled-coil region that may activate dynamin' s GTPas6 

10 activity, and a C-terminal proline-rich domain that contains several motifs that bind SH3 domains on 
other proteins. Some dynamin-related proteins do not contain the pleckstrin homology domain or the 
proline-rich domain. (See McNiven, MA. (1998) Cell 94:i51-154; Scaife, R.M. and R.L. Margolis 
(1997) Cell. Signal. 9:395-401.) 

Hie cytoskeleton is reviewed in Lodish, H. et al. (1995) Molecular Cell Biology . Scientific 

15 American Books, New York NY. * 

Ribosomal Molecules 

Ribosomal RNAs (rRNAs) are assembled, along with ribosomal proteins, into ribosomes,* 
which are cytoplasmic particles that translate messenger RNA into polypeptides. The eukaryotic • 

20 ribosome is composed of a 60S (large) subunit and a 40S (small) subunit, which together form the 
80S ribosome. In addition to the 18S, 28S, 5S, and 5.8S rRNAs, the ribosome also contains more 
than fifty proteins. The ribosomal proteins have a prefix which denotes the subunit to which they 
belong, either L (large) or S (small). Ribosomal protein activities include binding rRNA and 
organizing the conformation of the junctions between rRNA helices (Woodson, S.A and N.B. 

25 Leontis (1998) Curr. Opin. Struct Biol. 8:294-300; Ramakrishnan, V. and S.W. White (1998) Trends 
Biochem. Sci. 23:208-212.) Three important sites are identified on the ribosome. The aminoacyl- 
tRNA site (A site) is where charged tRNAs (with the exception of the initiator-tRNA) bind on arrival 
at the ribosome. The peptidyl-tRNA site (P site) is where new peptide bonds are formed, as well as 
where the initiator tRNA binds. The exit site (E site) is where deacylated tRNAs bind prior to their 

3 o release from the ribosome. (Hie ribosome is reviewed in Stryer, L. (1995) Biochemistry W.H. 
Freeman and Company, New York NY, pp. 888-908; and Lodish, H. et al. (1995) Molecular Cell 
Biology Scientific American Books, New York NY. pp. 1 19-138.) 

Chromatin Molecules 
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The nuclear DNA of eukaryotes is organized into chromatin. Two types of chromatin are 
observed: euchromatin, some of which may be transcribed, and heterochromatin so densely packed that 
much of it is inaccessible to transcription. Chromatin packing thus serves to regulate protein 
expression in eukaryotes. Bacteria lack chromatin and the chromatin-packing level of gene regulatioa 
5 The fundamental unit of chromatin is the nucleosome of 200 DNA base pairs associated with 

two copies each of histones H2A, H2B, H3, and H4. Adjascent nucleosomes are linked by another 1 
class of histones, HI . Low molecular weight non-histone proteins called the high mobility group 
(HMG), associated with chromatin, may function in the unwinding of DNA and stabilization of single- 
stranded DNA Chromodomain proteins function in compaction of chromatin into its transcriptionally 

10. silent heterochromatin form. 

During mitosis, all DNA is compacted into heterochromatin and transcription ceases. 
Transcription in interphase begins, with the activation of a region of chromatin. Active chromatin is 
decondensed. Decondensation appears to be accompanied by changes in binding coefficient, 
phosphorylation and acetylation states of chromatin histones, HMG'proteins HMG13 and HMG17 

15 selectively bind activated chromatin. Topoisomerases remove superhelical tension on DNA. The 
activated region decondenses, allowing gene regulatory proteins and transcription factors to assemble 
ontheDNA. 

Patterns of chromatin structure can be stably inherited, producing heritable patterns of gene 
expression. In mammals, one of the two X chromosomes in each female cell is inactivated by 
2 o condensation to heterochromatin during zygote development. The inactive state of this chromosome is 
inherited, so that adult females are mosaics of clusters of paternal-X and maternal-X clonal cell groups. 
The condensed X chromosome is reactivated in meiosis. 

Chromatin is associated with disorders of protein expression such as thalassemia, a genetic 
anemia resulting from the removal of the locus control region (LCR) required for decondensation of the 

2 5 globin gene locus. 

For a review of chromatin structure and function see Alberts, B. et al. (1994) Molecular Cell 
Biology, third edition, Garland Publishing, Inc., New York NY, pp. 351-354, 433-439. 

Electron Transfer Associated Molecules 

3 o Electron carriers such as cytochromes accept electrons from NADH or FADHj and donate 

them to other electron carriers. Most electron-transferring proteins, except ubiquinone, are prosthetic 
groups such as flavins, heme, FeS clusters, and copper, bound to inner membrane proteins. 
Adrenodoxin, for example, is an FeS protein that forms a complex with NADPH:adrenodoxin 
reductase and cytochrome p450. Cytochromes contain a heme prosthetic group, a porphyrin ring 
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containing a tightly bound iron atom. Electron transfer reactions play a crucial role in cellular energy 
production. 

Energy is produced by the oxidation of glucose and fatty acids. Glucose is initially converted 
to pyruvate in the cytoplasm. Fatty acids and pyruvate are transported to the mitochondria for 
5 complete oxidation to C0 2 coupled by enzymes to the transport of electrons from NADH and FADHa 
to oxygen and to the synthesis of ATP (oxidative phosphorylation) from ADP and P 4 . 

Pyruvate is transported into the mitochondria and converted to acetyl-CoA for oxidation via 
the citric acid cycle, involving pyruvate dehydrogenase components, dihydrolipoyl transacetylase, and 
dihydrolipoyl dehydrogenase. Enzymes involved in the citric acid cycle include: citrate synthetase, 
10 aconitases, isocitrate dehydrogenase, alpha-ketoglutarate dehydrogenase complex including 
transsuccinylases, succinyl CoA synthetase, succinate dehydrogenase, fumarases, and malate 
dehydrogenase. Acetyl CoA is oxidized to C0 2 with concoipitant formation of NADH, FADH 2 , and 
GTP. In oxidative phosphorylation, the transfer of electrons from NADH and FADH 2 to oxygen by 
dehydrogenases is coupled to the synthesis of ATP from ADP and P { by the F^ ATPase complex in 
15 the mitochondrial inner membrane. Enzyme complexes responsible for electron transported ATP 
synthesis include the F^ ATPase complex, ubiquinone(CoQ)-cytochrome c reductase, ubiquinone 
reductase, cytochrome b, cytochrome FeS protein, and cytochrome c oxidase. 

ATP synthesis requires membrane transport enzymes including the phosphate transporter and 
the ATP-ADP antiport protein. The ATP-binding casette (ABC) superfamily has also been suggested 
20 as belonging to the mitochondrial transport group (Hogue, DL. et al. (1999) J. Mol. Biol. 285:379- 
389). Brown fat uncoupling protein dissipates oxidative energy as heat, and may be involved the fever 
response to infection and trauma (Cannon, B. et al. (1998) Ann. NY Acad. Sci. 856:171-187). 

Mitochondria are oval-shaped organelles comprising an outer membrane, a tightly folded 
inner membrane, an intermembrane space between the outer and inner membranes, and a matrix 

2 5 inside the inner membrane. The outer membrane contains many porin molecules that allow ions and 

charged molecules to enter the intermembrane space, while the inner membrane contains a variety of 
transport proteins that transfer only selected molecules. Mitochondria are the primary sites of energy 
production in cells. 

Mitochondria contain a small amount of DNA. Human mitochondrial DNA encodes 13 

3 o proteins, 22 tRNAs, and 2 rRNAs. Mitochondrial-DNA encoded proteins include NADH-Q 

reductase, a cytochrome reductase subunit, cytochrome oxidase subunits, and ATP synthase subunits. 

Electron-transfer reactions also occur outside the mitochondria in locations such as the 
endoplasmic reticulum, which plays a crucial role in lipid and protein biosynthesis. Cytochrome b5 
is a central electron donor for various reductive reactions occurring on the cytoplasmic surface of liver 
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endoplasmic reticulum. Cytochrome b5 has been found in Golgi, plasma, endoplasmic reticulum 
(ER), and microbody membranes. 

For a review of mitochondrial metabolism and regulation,' see Lodish, H. et al. (1995) 
Molecular Cell Biology. Scientific American Books, New York NY, pp. 745-797 and Stryer (1995) 
Biochemistry. W.H. Freeman and Co., San Francisco CA, pp 529-558, 988-989. 

The majority of mitochondrial proteins are encoded by nuclear genes, are synthesized on 1 
cytosolic ribosomes, and are imported into the mitochondria. Nuclear-encoded proteins which are 
destined for the mitochondrial matrix typically contain positively-charged amino terminal signal 
sequences. Import of these preproteins from the cytoplasm requires a multisubunit protein complex 
in the outer membrane known as the translocase of outer mitochondrial membrane (TOM; previously 
designated MOM; Pfanner, N. et al. (1996) Trends Biochem. Sci. 21:51-52) and at least three inner 
membrane proteins which comprise the translocase of inner mitochondrial membrane (TIM; 
previously designated MIM; Pfanner, supra) . An inside-negative membrane potential across the inner 
mitochondrial membrane is also required for preprotein import Preproteins are recognized by surface 
receptor components of the TOM complex and are translocated through a proteinaceous pore formed 
by other TOM components. Proteins targeted to the matrix are then recognized by the import 
machinery of the TIM complex. The import systems of the outer and inner membranes can function 
independently (Segui-Real, B. et al. (1993) EMBO J. 12:2211-2218). 

Once precursor proteins are in the mitochondria, the leader peptide is cleaved by a signal 
peptidase to generate the mature protein. Most leader peptides are removed in a one step process by a 
protease termed mitochondrial processing peptidase (MPP) (Paces, V. et al. (1993) Proc. Natl. Acad. 
Sci. USA 90:5355-5358). In some cases a two-step process occurs in which MPP generates an 
intermediate precursor form which is cleaved by a second enzyme, mitochondrial intermediate 
peptidase, to generate the mature protein. 

Mitochondrial dysfunction leads to impaired calcium buffering, generation of free radicals 
that may participate in deleterious intracellular and extracellular processes, changes in mitochondrial 
permeability and oxidative damage which is observed in several neurodegenerative diseases. 
Neurodegenerative diseases linked to mitochondrial dysfunction include some forms of Alzheimer's 
disease, Friedreich's ataxia, familial amyotrophic lateral sclerosis, and Huntington's disease (Beal, 
M.F. (1998) Biochim. Biophys. Acta 1366:211-213). The myocardium is heavily dependent on 
oxidative metabolism, so mitochondrial dysfunction often leads to heart disease (DiMauro, S. and M. 
Hirano (1998) Curr. Opin. Cardiol 13:190-197). Mitochondria are implicated in disorders of cell 
proliferation, since they play an important role in a cell's decision to proliferate or self-destruct 
through apoptosis. Hie oncoprotein Bcl-2, for example, promotes cell proliferation by stabilizing 
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mitochondrial membranes so that apoptosis signals are not'released (Susin, S.A. (1998) Biochim. 
Biophys. Acta 1366:151-165). 

Transcription Factor Molecules 

5 Multicellular organisms are comprised of diverse cell types that differ dramatically both in , 

structure and function. The identity of a cell is determined by its characteristic pattern of gene 
expression, and different cell types express overlapping but distinctive sets of genes throughout 
development Spatial and temporal regulation of gene expression is critical for the control of cell 
proliferation, cell differentiation, apoptosis, and other processes that contribute to organismal 
io development Furthermore, gene expression is regulated in response to extracellular signals that 
mediate cell-cell communication and coordinate the activities of different cell types. Appropriate 
gene regulation also ensures that cells function efficiently by expressing only those genes whose 
functions are required at a given time. 

1 Transcriptional regulatory proteins are essential for the control of gene expression. Some of 
1 5 these proteins function as transcription factors that initiate, activate, repress, or terminate gene 

transcription. Transcription factors generally bind to the promoter, enhancer, and upstream regulatory 
regions of a gene in a sequence-specific manner, although some factors bind regulatory elements 
within or downstream of a gene's coding region. Transcription factors may bind to a specific region 
of DNA singly or as a complex with other accessory factors. (Reviewed in Lewin, B. (1990) Genes 
20 IV, Oxford University Press, New York NY, and Cell Press, Cambridge MA, pp. 554-570.) 

The double helix structure and repeated sequences of DNA create topological and chemical 
features which can be recognized by transcription factors. These features are hydrogen bond donor 
and acceptor groups, hydrophobic patches, major and minor grooves, and regular, repeated stretches 
of sequence which induce distinct bends in the helix. Typically, transcription factors recognize • 

2 5 specific DNA sequence motifs of about 20 nucleotides in length. Multiple, adjacent transcription 

factor-binding motifs may be required for gene regulation. 

Many transcription factors incorporate DNA-binding structural motifs which comprise either 
a helices or 8 sheets that bind to the major groove of DNA Four well-characterized structural motifs 
are helix-turn-helix, zinc finger, leucine zipper, and helix-loop-helix. Proteins containing these 

3 o motifs may act alone as monomers, or they may form homo- or heterodimers that interact with DNA. 

The helix-turn-helix motif consists of two a helices connected at a fixed angle by a short 
chain of amino acids. One of the helices binds to the major groove. Helix-turn-helix motifs are 
exemplified by the homeobox motif which is present in homeodomain proteins. These proteins are 
critical for specifying the anterior-posterior body axis during development and are conserved 
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throughout the animal kingdom. The Antennapedia and Ultrabithorax proteins of Drosophila 
melanoeaster are prototypical homeodomain proteins (Pabo, CO. and R.T. Sauer (1992) Annu. Rev. 
Biochem. 61:1053-1095). 

. The zinc finger motif, which binds zinc ions, generally contains tandem repeats of about 30 
5 amino acids consisting of periodically spaced cysteine and histidine residues. Examples of this 

sequence pattern, designated C2H2 and C3HC4 ("RING" finger), have been described (Lewin, supra) . 
Zinc finger proteins each contain an a helix and an antiparallel B sheet whose proximity and 
conformation are maintained by the zinc ion. Contact with DNA is made by the arginine prece ding 
the a helix and by the second, third, and sixth residues of the a helix. Variants of the zinc finger 

i o motif include poorly defined cysteine-rich motifs which bind zinc or other metal ions. These motifs 
may not contain histidine residues and are generally nonrepetitive. 

The leucine zipper motif comprises a stretch of amino acids rich in leucine which can fonn an 
amphipathic a helix. This structure provides the basis for dimerization of two leucine zipper 
proteins. The region adjacent to the leucine zipper is usually basic, and upon protein dimerization, is 

is optimally positioned for binding to the major groove. Proteins containing such motifs are generally 
referred to as bZBP transcription factors. 

The helix-loop-helix motif (HLH) consists of a short a helix connected by a loop to a longer 
a helix. The loop is flexible and allows the two helices to fold back against each other and to bind to 
DNA. Hie transcription factor Myc contains a prototypical HLH motif. 

20 Most transcription factors contain characteristic DNA binding motifs, and variations on the 

above motifs and new motifs have been and are currently being characterized (Faisst, S. and S. Meyer 
(1992) Nucleic Acids Res. 20:3-26). 

Many neoplastic disorders in humans can be attributed to inappropriate gene expression. 
Malignant cell growth may result from either excessive expression of tumor promoting genes or 

25 insufficient expression of tumor suppressor genes (Cleary, ML. (1992) Cancer Surv. 15:89-104). 
Chromosomal translocations may also produce chimeric loci which fuse the coding sequence of one 
gene with the regulatory regions of a second unrelated gene. Such an arrangement likely results in 
inappropriate gene transcription, potentially contributing to malignancy. 

In addition, the immune system responds to infection or trauma by activating a cascade of 

3 o events that coordinate the progressive selection, amplification, and mobilization of cellular defense 
mechanisms. A complex and balanced program of gene activation and repression is involved in this 
process. However, hyperactivity of the immune system as a result of improper or insufficient 
regulation of gene expression may result in considerable tissue or organ damage. This damage is well 
documented in immunological responses associated with arthritis, allergens, heart attack, stroke, and 
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infections (Isselbacher, KJ. et al. (1996) Harrison's Principles of Internal Medicine. 13/e, McGraw 
Hill, Inc. and Teton Data Systems Software). 

Furthermore, the generation of multicellular organisms is based upon the induction and 
coordination of cell differentiation at the appropriate stages of development Central to this process is 
5 differential gene expression, which confers the distinct identities of cells and tissues throughout the 
body. Failure to regulate gene expression during development can, result in developmental disorders. 
Human developmental disorders caused by mutations in zinc finger-type transcriptional regulators 
include: urogenenital developmental abnormalities associated with WT1 ; Greig 
cephalopolysyndactyly, Pallister-Hall syndrome, and postaxial Polydactyly type A (GLI3); and 
10 Townes-Brocks syndrome, characterized by anal, renal, limb, and ear abnormalities (SALL1) 

(Engelkamp, D. and V. van Heyningen (1996) Curr. Opin. Genet Dev. 6:334-342; Kohlhase, J. et al. 
(1999) Am. J. Hum. Genet 64:435-445). 

Cell Membrane Molecules 

15 Eukaryotic cells are surrounded by plasma membranes which enclose the cell and maintain an 

environment inside the cell that is distinct from its surroundings. In addition, eukaryotic organisms 
are distinct from prokaryotes in possessing many intracellular organelle and vesicle structures. Many 
of the metabolic reactions which distinguish eukaryotic biochemistry from prokaryotic biochemistry 
take place within these structures. The plasma membrane and the membranes surrounding organelles 

20 and vesicles are composed of phosphoglycerides, fatty acids, cholesterol, phospholipids, glycolipids, 
proteoglycans, and proteins. These components confer identity and functionality to the membranes 
with which they associate. 
Integral Membrane Proteins 

The majority of known integral membrane proteins are transmembrane proteins (TM) which 

25 are characterized by an extracellular, a transmembrane, and an intracellular domain. TM domains are 
typically comprised of 15 to 25 hydrophobic amino acids which are predicted to adopt an a-helical 
conformation. TM proteins are classified as bitopic (Types I and II) and polytopic (Types III and IV) 
(Singer, SJ. (1990) Annu. Rev. Cell Biol. 6:247-296). Bitopic proteins span the membrane once 
while polytopic proteins contain multiple membrane-spanning segments. TM proteins function as 

3 o cell-surface receptors, receptor-interacting proteins, transporters of ions or metabolites, ion channels, 
cell anchoring proteins, and cell type-specific surface antigens. 

Many membrane proteins (MPs) contain amino acid sequence motifs that target these proteins 
to specific subcellular sites. Examples of these motifs include PDZ domains, KDEL, RGD, NGR, 
and GSL sequence motifs, von Willebrand factor A (vWFA) domains, and EGF-like domains. RGD, 
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, NGR, and GSL motif-containing peptides have been used as drug delivery agents in targeted cancer 

treatment of tumor vasculature (Arap, W. et al. (1998) Science 279:377-380). Furthermore, MPs may 
also contain amino acid sequence motifs, such as the carbohydrate recognition domain (CRD), that 
mediate interactions with extracellular or intracellular molecules, 
5 G-Protein Coupled Receptors 

G-protein coupled receptors (GPCR) are a superfamily of integral membrane proteins which 
i transduce extracellular signals. GPCRs include receptors for biogenic amines, lipid mediators of 

inflammation, peptide hormones, and sensory signal mediators. The structure of these 
highly-conserved receptors consists of seven hydrophobic transmembrane regions, an extracellular 
10 N-terminus, and a cytoplasmic C-ter minus. Three extracellular loops alternate with three intracellular 
loops to link the seven transmembrane regions. Cysteine disulfide bridges connect the second and 
third extracellular loops. The most conserved regions of GPCRs are the transmembrane regions and 
the first two cytoplasmic loops. A conserved, acidic-Arg-aromatic residue triplet present in the 
second cytoplasmic loop may interact with G proteins. A GPCR consensus pattern is characteristic of 
1 1 5 most proteins belonging to this superfamily (ExPASy PROSITE document PS00237; and Watson, S. 
and S. Arkinstall (1994) Hie G-protein Linked Receptor Facts Book . Academic Press, San Diego CA, 
pp. 2-6). Mutations and changes in transcriptional activation of GPCR-encoding genes have been 
associated with neurological disorders such as schizophrenia, Parkinson* s disease, Alzheimer's 
disease, drug addiction, and feeding disorders. 
20 Scavenger Receptors 

Macrophage scavenger receptors with broad ligand specificity may participate in the binding 
of low density lipoproteins (LDL) and foreign antigens. Scavenger receptors types I and II are 
trimeric membrane proteins with each subunit containing a small N-terminal intracellular domain, a 
transmembrane domain, a large extracellular domain, and a C-terminal cysteine-rich domain. The 

2 5 extracellular domain contains a short spacer region, an a-helicai coiled-coil region, and a triple helical 

collagen-like region. These receptors have been shown to bind a spectrum of ligands, including 
chemically modified lipoproteins and albumin, polyribonucleotides, polysaccharides, phospholipids, 
and asbestos (Matsumoto, A. et al. (1990) Proc. Natl. Acad. Sci. USA 87:9133-9137; andElomaa, O. 
et al, (1995) Cell 80:603-609), The scavenger receptors are thought to play a key role in 

3 o atherogenesis by mediating uptake of modified LDL in arterial walls, and in host defense by binding 

bacterial endotoxins, bacteria, and protozoa. 
Tetraspan Family Proteins 

The transmembrane 4 superfamily (TM4SF) or tetraspan family is a multigene family 
encoding type in integral membrane proteins (Wright, M.D. and M.G. Tomlinson (1994) Immunol. 

100 

BNSDOCID: <WO 0162927A2_I_> 



I 



WO 01/62927 




PCT/US01/06059 



Today 15:588-594). Hie 1M4SF is comprised of membrane proteins which traverse the cell 
membrane four times. Members of the TM4SF include platelet and endothelial cell membrane 
proteins, melanoma-associated antigens, leukocyte surface glycoproteins, colonal carcinoma antigens, 
tumor-associated antigens, and surface proteins of the schistosome parasites (Jankowski, S.A. (1994) 
5 Oncogene 9:1205-121 1). Members of the TM4SF share about 25-30% amino acid sequence identity 
with one another. 

A number of TM4SF members have been implicated in signal transduction, control of cell 
adhesion, regulation of cell growth and proliferation, including development and oncogenesis, and 
cell motility, including tumor cell metastasis. Expression of TM4SF proteins is associated with a 
10 variety of tumors and the level of expression may be altered when cells are growing or activated. 
Tumor Antigens 

Tumor antigens are cell surface molecules that are differentially expressed in tumor cells 
relative to normal cells. Tumor antigens distinguish tumor cells immunologically from normal cells 
and provide diagnostic and therapeutic targets for human cancers (Takagi, S. et al. (1995) Int J. * 
15 Cancer 61:706-715; Liu, E. et al. (1992) Oncogene 7:1027-1032). * * 

Leukocyte Antigens 

Other types of cell surface antigens include those identified on leukocytic cells of the immune 
system. These antigens have been identified using systematic, monoclonal antibody (mAb)-based 
"shot gun" techniques. These techniques have resulted in the production of hundreds of mAbs 

2 o directed against unknown cell surface leukocytic antigens. These antigens have been grouped into 
"clusters of differentiation" based on common immunocytochemical localization patterns in various 
differentiated and undifferentiated leukocytic cell types. Antigens in a given cluster are presumed to 
identify a single cell surface protein and are assigned a "cluster of differentiation" or "CD" 
designation. Some of the genes encoding proteins identified by CD antigens have been cloned and 

2 5 verified by standard molecular biology techniques. CD antigens have been characterized as both 
transmembrane proteins and cell surface proteins anchored to the plasma membrane via covalent 
attachment to fatty acid-containing glycolipids such as glycosylphosphatidylinositol (GPI). 
(Reviewed in Barclay, AN. et al. (1995) The Leucocyte Antigen Facts Book , Academic Press, San 
Diego CA, pp. 17-20.) 

30 Ion Channels 

Ion channels are found in the plasma membranes of virtually every cell in the body. For 
example, chloride channels mediate a variety of cellular functions including regulation of membrane 
potentials and absorption and secretion of ions across epithelial membranes. Chloride channels also 
regulate the pH of organelles such as the Golgi apparatus and endosomes (see, e.g., Greger, R. (1988) 
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Anim. Rev. Physiol. 50: 1 1 1-122). Electrophysiological and pharmacological properties of chloride 
channels, including ion conductance, current-voltage relationships, and sensitivity to modulators, 

suggest that different chloride channels exist in muscles, neurons, fibroblasts, epithelial cells, and 

i 

lymphocytes. ■ 

s Many ion channels have sites for phosphorylation by one or more protein kinases including 

protein kinase A, protein kinase C, tyrosine kinase, and casein kinase n, all of which regulate ion 
channel activity in cells. Inappropriate phosphorylation of proteins in cells has been linked to 
changes in cell cycle progression and cell differentiation. Changes in the cell cycle have been linked 
to induction of apoptosis or cancer. Changes in cell differentiation have been linked to diseases and 
10 disorders of the reproductive system, immune system, skeletal muscle, and other organ systems. 
Proton Pumps 

Proton ATPases comprise a large class of membrane proteins that use the energy of ATP 
hydrolysis to generate an electrochemical proton gradient across a membrane. The resultant gradient 
may be used to transport other ions across the membrane (Na\ K + , or CI*) or to maintain organelle 

15 pH. Proton ATPases are further subdivided into the mitochondrial F-ATPases, the plasma membrane 
ATPases, and the vacuolar ATPases. The vacuolar ATPases establish and maintain an acidic pH 
within various organelles involved in the processes of endocytosis and exocytosis (Mellman, I. et al. 
(1986) Annu. Rev. Biochem. 55:663-700). 

Proton-coupled, 12 membrane-spanning domain transporters such as PEPT 1 and PEPT 2 are 

2 o , responsible for gastrointestinal absorption and for renal reabsorption of peptides using an 

electrochemical HP gradient as the driving force. Another type of peptide transporter, the TAP 
transporter, is a heterodimer consisting of TAP 1 and TAP 2 and is associated with antigen 
processing. Peptide antigens are transported across the membrane of the endoplasmic reticulum by 
TAP so they can be expressed on the cell surface in association with MHC molecules. Each TAP 

2 5 protein consists of multiple hydrophobic membrane spanning segments and a highly conserved. 
ATP-binding cassette (Boll, M. et d. (1996) Proc. Natl. Acad Sci. USA 93:284-289). Pathogenic 
microorganisms, such as herpes simplex virus, may encode inhibitors of TAP-mediated peptide 
transport in order to evade immune surveillance (Marusina, K. and J J Manaco (1996) Curr. Opin. 
Hematol. 3:19-26). 

30 ABC Transporters - 

The ATP-binding cassette (ABC) transporters, also called the "traffic ATPases", comprise a 
superf amily of membrane proteins that mediate transport and channel functions in prokaryotes and 
eukaryotes (Higgins, C.F. (1992) Annu. Rev. Cell Biol. 8:67-113). ABC proteins share a similar 
overall structure and significant sequence homology. All ABC proteins contain a conserved domain 
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of approximately two hundred amino acid residues which includes one or more nucleotide binding 
domains. Mutations in ABC transporter genes are associated with various disorders, such as 
hyperbilirubinemia II/Dubin- Johnson syndrome, recessive Stargardt's disease, X-linked 
adrenoleukodystrophy, multidrug resistance, celiac disease, and cystic fibrosis. 
5 Peripheral and Anchored Membrane Proteins 

Some membrane proteins are not membrane-spanning but are attached to the plasma 
membrane via membrane anchors or interactions with integral membrane proteins. Membrane 
anchors are covalently joined to a protein post-translationally and include such moieties as prenyl, 
myristyl, and glycosylphosphatidyl inositol groups. Membrane localization of peripheral and 
10 anchored proteins is important for their function in processes such as receptor-mediated signal 

transduction. For example, prenylation of Ras is required for its localization to the plasma membrane 
and for its normal and oncogenic functions in signal transduction. 
Vesicle Coat Proteins 

, Intercellular communication is essential for the development and survival of multicellular 

15 organisms. Cells communicate with one another through the secretion and uptake of protein 

signaling molecules. The uptake of proteins into the cell is achieved by the endocytic pathway, in 
which the interaction of extracellular signaling molecules with plasma membrane receptors results in 
the formation of plasma membrane-derived vesicles that enclose and transport the molecules into the • 
cytosol. These transport vesicles fuse with and mature into endosomal and lysosomal (digestive) 
20 compartments. The secretion of proteins from the cell is achieved by exocytosis, in which molecules 
inside of the cell proceed through the secretory pathway. In this pathway, molecules transit from the 
ER to the Golgi apparatus and finally to the plasma membrane, where they are secreted from the cell. 

Several steps in the transit of material along the secretory and endocytic pathways require the 
formation of transport vesicles. Specifically, vesicles form at the transitional endoplasmic reticulum 

2 5 (tER), the rim of Golgi cisternae, the face of the Trans-Golgi Network (TGN), the plasma membrane 

(PM), and tubular extensions of the endosomes. Vesicle formation occurs when a region of 
membrane buds off from the donor organelle. The membrane-bound vesicle contains proteins to be 
transported and is surrounded by a proteinaceous coat, the components' of which are recruited from 
the cytosol. Two different classes of coat protein have been identified. Clathrin coats form on 

3 o vesicles derived from the TGN and PM, whereas coatomer <COP) coats form on vesicles derived 

from the ER and Golgi. COP coats can be further classified as COPI, involved in retrograde traffic 
through the Golgi and from the Golgi to the ER, and COPII, involved in anterograde traffic from the 
ER to the Golgi (Mellman, supra) . 

In clathrin-based vesicle formation, adapter proteins bring vesicle cargo and coat proteins 

103 



BNSDOCID: <WO 01 62927A2_I_> 



WO 01/62927 



PCTYUS01/06059 



together at the surface of the budding.membrane. Adapter protein-1 and -2 select cargo from the 
TGN and plasma membrane, respectively, based on molecular information encoded on the 
cytoplasmic tail of integral membrane cargo proteins. Adapter proteins also recruit clathrin to the bud 
site. Clathrin is a protein complex consisting of three large and three small polypeptide chains 
5 arranged in a three-legged structure called a triskelion. Multiple triskelions and other coat proteins 
appear to self-assemble on the membrane to form a coated pit .This assembly process may serve to 
deform the membrane into a budding vesicle. GTP-bound ADP-ribosylation factor (Arf) is also 
incorporated into the coated assembly. Another small G-protein, dynamin, forms a ring complex 
around the neck of the forming vesicle and may provide the mechanochemical force to seal the bud, 

10 thereby releasing the vesicle. Hie coated vesicle complex is then transported through the cytosol. 
During the transport process, Arf-bound GTP is hydrolyzed to GDP, and the coat dissociates from the 
transport vesicle (West, M.A. et al. (1997) J. Cell Biol. 138:1239-1254). 

Vesicles which bud from the ER and the Golgi are covered with a protein coat similar to the 
clathrin coat of endocytic and TGN vesicles. The coat protein (COP) is assembled from cytosolic 

15 precursor molecules at specific budding regions on the organelle. The COP coat consists of two 

major components, a G-protein (Arf or Sar) and coat protomer (coatomer). Coatomer is an equimolar 
complex of seven proteins, termed alpha-, beta-, beta'-, gamma-, delta-, epsilon- and zeta-COP. The 
coatomer complex binds to dilysine motifs contained on the cytoplasmic tails of integral membrane 
proteins. These include the KKXX retrieval motif of membrane proteins of the ER and 

2 o dibasic/diphenylamine motifs of members of the p24 family. The p24 family of type I membrane 
proteins represent the major membrane proteins of COPI vesicles (Harter, C. and F.T. Wieland (1998) 
Proc. Natl. Acad. Sci. USA 95:11649-11654). 

Organelle Associated Molecules 

2 5 Eukaryotic cells are organized into various cellular organelles which has the effect of 

separating specific molecules and their functions from one another and from the cytosol. Within the 
cell, various membrane structures surround and define these organelles while allowing them to 
interact with one another and the cell environment through both active and passive transport 
processes. Important cell organelles include the nucleus, the Golgi apparatus, the endoplasmic 

3 o reticulum, mitochondria, peroxisomes, lysosomes, endosomes, and secretory vesicles. 

Nucleus 

The cell nucleus contains all of the genetic information of the cell in the form of DNA, and 
the components and machinery necessary for replication of DNA and for transcription of DNA into 
RNA. (See Alberts, B. et al. (1994) Molecular Biology of the Cell . Garland Publishing Inc., New 
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York NY, pp. 335-399.) DNA is organized into compact structures in the nucleus by interactions 
with various DNA-binding proteins such as histones and non-histone chromosomal proteins. 
DNA-specific nucleases, DNAses, partially degrade these compacted structures prior to DNA 
replication or transcription. DNA replication takes place with the aid of DNA helicases which 
5 unwind the double-stranded DNA helix, and DNA polymerases that duplicate the separated DNA 
strands. 

Transcriptional regulatory proteins are essential for the control of gene expression. Some of 
these proteins function as transcription factors that initiate, activate, repress, or terminate gene 
transcription. Transcription factors generally bind to the promoter, enhancer, and upstream 

10 regulatory regions of a gene in a sequence-specific maimer, although some factors bind regulatory 

elements within or downstream of a gene's coding region. Transcription factors may bind to a specific 
region of DNA singly or as a complex with other accessory factors. (Reviewed in Lewin, B. (1990) 
Genes IV . Oxford University Press, New York NY, and Cell Press, Cambridge MA, pp. 554-570.) 
Many transcription factors incorporate DNA-binding structural motifs which comprise either a 

1 5 helices or 6 sheets that bind to the major groove of DNA. Four well-characterized structural motifs 
are helix-turn-helix, zinc finger, leucine zipper, and helix-loop-helix. Proteins containing these 
motifs may act alone as monomers, or they may form homo- or heterodimers that interact with DNA. 

Many neoplastic disorders in humans can be attributed to inappropriate gene expression. 
Malignant cell growth may result from either excessive expression of tumor promoting genes or 

20 insufficient expression of tumor suppressor genes (Cleary, Mi. (1992) Cancer Surv. 15:89-104). 
Chromosomal translocations may also produce chimeric loci which fuse the coding sequence of one 
gene with the regulatory regions of a second unrelated gene. Such an arrangement likely results in 
inappropriate gene transcription, potentially contributing to malignancy. 

In addition, the immune system responds to infection or trauma by activating a cascade of 

2 5 events that coordinate the progressive selection, amplification, and mobilization of cellular defense 
mechanisms. A complex and balanced program of gene activation and repression is involved in this 
process. However, hyperactivity of the immune system as a result of improper or insufficient 
regulation of gene expression may result in considerable tissue or organ damage. This damage is well 
documented in immunological responses associated with arthritis, allergens, heart attack, stroke, and 

30 infections (Isselbacher, KJ. et al. (1996) Harrison's Principles of Internal Medicine. 13/e, McGraw 
Hill, Inc. and Teton Data Systems Software). 

Transcription of DNA into RNA also takes place in the nucleus catalyzed by RNA 
polymerases. Three types of RNA polymerase exist RNA polymerase I makes large ribosomal 
RNAs, while RNA polymerase HI makes a variety of small, stable RNAs including 5S ribosomal 
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RNA and the transfer RNAs (tRNA). RNA polymerase II transcribes genes that will be translated 

i 

into proteins. The primary transcript of RNA polymerase II is called heterogenous nuclear RNA 
(hnRNA), and must be further processed by splicing to remove non-coding sequences called introns. 
RNA splicing is mediated by small nuclear ribonucleoprotein complexes, or snRNPs, producing 
5 mature messenger RNA (mRNA) which is then transported out of the nucleus for translation into 
proteins, 
i Nucleolus 

The nucleolus is a highly organized subcompartment in the nucleus that contains high 
concentrations of RNA and proteins and functions mainly in ribosomal RNA synthesis and assembly 
10 (Alberts, et al. supra, pp. 379-382). Ribosomal RNA (rRNA) is a structural RNA that is complexed 
with proteins to form ribonucleoprotein structures called ribosomes. Ribosomes provide the platform 
on which protein synthesis takes place. 

Ribosomes are assembled in the nucleolus initially from a large, 45S rRNA combined with a 
variety of proteins imported from the cytoplasm, as well as smaller, 5S rRNAs. Later processing of 
1 5 the immature ribosome results in formation of smaller ribosomal subunits which are'transported from 
the nucleolus to the cytoplasm where they are assembled into functional ribosomes. 
Endoplasmic Reticulum 

In eukaryotes, proteins are synthesized within the endoplasmic reticulum (ER), delivered from 
the ER to the Golgi apparatus for post-translational processing and sorting, and transported from the 
20 Golgi to specific intracellular and extracellular destinations. Synthesis of integral membrane proteins, 
secreted proteins, and proteins destined for the lumen of a particular organelle occurs on the rough 
endoplasmic reticulum (ER). The rough ER is so named because of the rough appearance in electron 
micrographs imparted by the attached ribosomes on which protein synthesis proceeds. Synthesis of 
proteins destined for the ER actually begins in the cytosol with the synthesis of a specific signal 

2 5 peptide which directs the growing polypeptide and its attached ribosome to the ER membrane where 

the signal peptide is removed and protein synthesis is completed. Soluble proteins destined far the 
ER lumen, for secretion, or for transport to the lumen of other organelles pass completely into the ER 
lumen. Transmembrane proteins destined for the ER or for other cell membranes are translocated 
across the ER membrane but remain anchored in the lipid bilayer of the membrane by one or more 

3 o membrane-spanning a-helical regions. 

Translocated polypeptide chains destined for other organelles or for secretion also fold and 
assemble in the ER lumen with the aid of certain "resident" ER proteins. Protein folding in the ER is 
aided by two principal types of protein isomerases, protein disulfide isomerase (PDI), and peptidyl- 
prolyl isomerase (PPI). PDI catalyzes the oxidation of free sulfhydryl groups in cysteine residues to 
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form intramolecular disulfide bonds in proteins. PPI, an enzyme that catalyzes the isomerization of 
certain proline imide bonds in oligopeptides and proteins, is considered to govern one of the rate 
limiting steps in the folding of many proteins to their final functional conformation. The cyclophilins 
represent a major class of PPI that was originally identified as the major receptor for the 
5 immunosuppressive drug cyclosporin A (Handschumacher, R.E. et al. (1984) Science 226:544-547). 
Molecular "chaperones" such as BiP (binding protein) in the ER recognize incorrectly folded proteins 
as well as proteins not yet folded into their final form and bind to them, both to prevent improper 
aggregation between them, and to promote proper folding. 

The "N-linked" glycosylation of most soluble secreted and membrane-bound proteins by 

1 o oligosacchrides linked to asparagine residues in proteins is also performed in the ER. This reaction is 

catalyzed by a membrane-bound enzyme, oligosaccharyl transferase. 
Golgi Apparatus 

Hie Golgi apparatus is a complex structure that lies adjacent to the ER in eukaryotic cells and 
, serves primarily as a sorting and dispatching station for products of the ER (Alberts, et al: supra , pp. 
15 600-610). Additional postradiational processing, principally additional glycosylation, also occurs in 
the Golgi. Indeed, the Golgi is a major site of carbohydrate synthesis, including most of the 
glycosaminoglycans of the extracellular matrix. N-linked oligosaccharides, added to proteins in the 
ER, are also further modified in the Golgi by the addition of more sugar residues to form complex N- 
linked oligosaccharides. "CD-linked" glycosylation of proteins also occurs in the Golgi by the addition 
20 of N-acetylgalactosamine to the hydroxyl group of a serine or threonine residue followed by the 
sequential addition of other sugar residues to the first This process is catalyzed by a series of 
glycosyltransferases each specific for a particular donor sugar nucleotide and acceptor molecule 
(Lodish, H. et al. (1995) Molecular Cell Biology . W.H. Freeman and Co., New York NY, pp.700- 
708). In many cases; both N- and O -linked oligosaccharides appear to be required for the secretion of 

2 5 proteins or the movement of plasma membrane glycoproteins to the cell surface. 

The terminal compartment of the Golgi is the Trans-Golgi Network (TGN), where both 
membrane and lumenal proteins are sorted for their final destination. Transport (or secretory) vesicles 
destined for intracellular compartments, such as lysosomes, bud off of the TGN. Other transport 
vesicles bud off containing proteins destined for the plasma membrane, such as receptors, adhesion 

3 o molecules, and ion channels, and secretory proteins, such as hormones, neurotransmitters, and digestive 

enzymes. 
Vacuoles 

The vacuole system is a collection of membrane bound compartments in eukaryotic cells that 
functions in the processes of endocytosis and exocytosis. They include phagosomes, lysosomes, 
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endosomes, and secretory vesicles. Endocytosis is the process in cells of internalizing nutrients, solutes 
or small particles (pinocytosis) or large particles such as internalized receptors, viruses, bacteria, or 
bacterial toxins (phagocytosis). Exocytosis is the process of transporting' molecules to the cell surface. 
It facilitates placement or localization of membrane-bound receptors or other membrane proteins and 
5 secretion of hormones, neurotransmitters, digestive enzymes, wastes, etc. 

A common property of all of these vacuoles is an acidic pH environment ranging from 
approximately pH 4.5-5.0. This acidity is maintained by the presence of a proton ATPase that uses the 
energy of ATP hydrolysis to generate an electrochemical, proton gradient across a membrane (Mellman, 

I. et aL (1986) Annu. Rev. Biochem. 55:663-700). Eukaryotic vacuolar proton ATPase (vp-ATPase) is 

i 

10 a multimeric enzyme composed of 3-10 different subvmits. One of these subunits is a highly 

hydrophobic polypeptide of approximately 16 kDa that is similar to the proteolipid component of vp- 
ATPases from eubact^ria, fungi, and plant vacuoles (Mandel, M. et aL (1988) Proc. NatL Acad. Sci. 
USA 85:5521-5524). The 16 kDa proteolipid component is the major subunit of the membrane portion 
of vp-ATPase and functions in the transport of protons across the membrane. 

15 Lvsosomes 

Lysosomes are membranous vesicles containing various hydrolytic enzymes used for the 
controlled intracellular digestion of macromolecules. Lysosomes contain some 40 types of enzymes 
including proteases, nucleases, glycosidases, lipases, phospholipases, phosphatases, and sulfatases, all 
of which are acid hydrolases that function at a pH of about 5. Lysosomes are surrounded by a unique 
20 membrane containing transport proteins that allow the final products of macromolecule degradation, 
such as sugars, amino acids, and nucleotides, to be transported to the cytosol where they may be 
either excreted or reutilized by the cell. A vp-ATPase, such as that described above, maintains the 
acidic environment necessary for hydrolytic activity (Alberts, supra, pp. 610-61 1). 
Endosomes 

2 5 Endosomes are another type of acidic vacuole that is used to transport substances from the 

cell surface to the interior of the cell in the process of endocytosis. Like lysosomes, endosomes have 
an acidic environment provided by a vp-ATPase (Alberts et al. supra, pp. 610-618). Two types of 
endosomes are apparent based on tracer uptake studies that distinguish their time of formation in the 
cell and their cellular location. Early endosomes are found near the plasma membrane and appear to 

3 o function primarily in the recycling of internalized receptors back to the cell surface. Late endosomes 

appear lata: in the endocytic process close to the Golgi apparatus and the nucleus, and appear to be 
associated with delivery of endocytosed material to lysosomes or to the TGN where they may be 
recycled. Specific proteins are associated with particular transport vesicles and their target 
compartments that may provide selectivity in targeting vesicles to their proper compartments. A 
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cytosolic prenylated GTP-binding protein, Rab, is one such protein. Rabs 4, 5, and 11 are associated 
with the early endosome* whereas Rabs 7 and 9 associate with the late endosome. 
Mitochondria , • 

Mitochondria are oval-shaped organelles comprising an outer membrane, a tightly folded 
5 inner membrane, an intermembrane space between the outer and inner membranes, and a matrix 
inside the inner membrane. The outer membrane contains many porin molecules that allow ions and 
charged molecules to enter the intermembrane space, while the inner membrane contains a variety of 
transport proteins that transfer only selected molecules. Mitochondria are the primary sites of energy 
production in cells. 1 

10 Energy is produced by the oxidation of glucose and fatty acids. Glucose is initially converted 

to pyruvate in the cytoplasm. Fatty acids and pyruvate are transported to the mitochondria for 
complete oxidation to C0 2 coupled by enzymes to the transport of electrons from NADH and FADHj 
to oxygen and to the synthesis of ATP (oxidative phosphorylation) from ADP and Pj. 

Pyruvate is transported into the mitochondria and converted to acetyl-CoA for oxidation via 

15 the citric acid cycle, involving pyruvate dehydrogenase components, dihydrolipoyl transacetylase, and 
dihydrolipoyl dehydrogenase. Enzymes involved in the citric acid cycle include: citrate synthetase, 
aconitases, isocitrate dehydrogenase, alpha-ketoglutarate dehydrogenase complex including 
transsuccinylases, supcinyl CoA synthetase, succinate dehydrogenase, fumarases, and malate 
dehydrogenase. Acetyl CoA is oxidized to C0 2 with concomitant formation of NADH, FADH 2 , and 

2 o GTP. In oxidative phosphorylation, the transfer of electrons from NADH and FADH 2 to oxygen by 

dehydrogenases is coupled to the synthesis of ATP from ADP and P i by the FqFj ATPase complex in 
the mitochondrial inner membrane. Enzyme complexes responsible for electron transport and ATP 
synthesis include the F^ ATPase complex, ubiquinone(CoQ)-cytochrome c reductase, ubiquinone 
reductase, cytochrome b, cytochrome FeS protein, and cytochrome c oxidase. 
25 Peroxisomes 

Peroxisomes, like mitochondria, are a major site of oxygen utilization. They contain one or 
more enzymes, such as catalase and urate oxidase, that use molecular oxygen to remove hydrogen 
atoms from specific organic substrates in an oxidative reaction that produces hydrogen peroxide 
(Alberts, supra , pp. 574-577). Catalase oxidizes a variety of substrates including phenols, formic 

3 o acid, formaldehyde, and alcohol and is important in peroxisomes of liver and kidney cells for 

detoxifying various toxic molecules that enter the bloodstream. Another major function of oxidative 
reactions in peroxisomes is the breakdown of fatty acids in a process called (J oxidation. P oxidation 
results in shortening of the alkyl chain of fatty acids by blocks of two carbon atoms that are converted 
to acetyl CoA and exported to the cytosol for reuse in biosynthetic reactions. 
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Also like mitochondria, peroxisomes import their proteins from the cytosol using a specific 
signal sequence located near the C -terminus of the protein; The importance of this import process is 
evident in the inherited human disease Zellweger syndrome, in which a defect in importing proteins 
into perixosomes leads to a perixosomal deficiency resulting in severe abnormalities in the brain, 
5 liver, and kidneys, and death soon after birth. One form of this disease has been shown to be due to a 
mutation in the gene encoding a perixosomal integral membrane protein called peroxisome assembly 
factor-1. 

The discovery of new human molecules satisfies a need in the art by providing new 
compositions which are useful in the diagnosis, study, prevention, and treatment of diseases associated 
10 with, as well as effects of exogenous compounds on, the expression of human molecules. 

SUMMARY OF THE INVENTION 

i 

The present invention relates to nucleic acid sequences comprising human diagnostic and 
therapeutic polynucleotides (dithp) as presented in the Sequence Listing. The ditbp uniquely identify 

,15 genes encoding human structural, functional, and regulatory molecules. 

The invention provides an isolated polynucleotide comprising a polynucleotide sequence 
selected from the group consisting of a) a polynucleotide sequence selected from the group consisting of 
SEQ ID NO:l-21 1; b) a naturally occurring polynucleotide sequence having at least 90% sequence • 
identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO:l-21 1; c) a 

20 polynucleotide sequence complementary to a); d) a polynucleotide sequence complementary to b); and 
e) an RNA equivalent of a) through d). In one alternative, the polynucleotide comprises a 
polynucleotide sequence selected from the group consisting of SEQ ID NO: 1-211. In another 
alternative, the polynucleotide comprises at least 60 contiguous nucleotides of a polynucleotide 
sequence selected from the group consisting of a) a polynucleotide sequence selected from the group 

25 consisting of SEQ ID NO:l-21 1; b) a naturally occurring polynucleotide sequence having at least 90% 
sequence identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO:l- 
21 1; c) a polynucleotide sequence complementary to a); d) a polynucleotide sequence complementary to 
b); and e) an RNA equivalent of a) through d). The invention further provides a composition for the 
detection of expression of human diagnostic and therapeutic polynucleotides comprising at least one 

3 o isolated polynucleotide comprising a polynucleotide sequence selected from the group consisting of a) a 
polynucleotide sequence selected from the group consisting of SEQ ID NO:l-211; b) a naturally 
occurring polynucleotide sequence having at least 90% sequence identity to a polynucleotide sequence 
selected from the group consisting of SEQ ID NO:l-211; c) a polynucleotide sequence complementary 
to a); d) a polynucleotide sequence complementary to b); and e) an RNA equivalent of a) through d); 
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and a detectable label. 

Tbe invention also provides a method for detecting a target polynucleotide in a sample, said 
target polynucleotide comprising a polynucleotide sequence selected from the group consisting of a) a 
polynucleotide sequence selected from the group consisting of SEQ ID NO:l-211; b) a naturally 
5 occurring polynucleotide sequence having at least 90% sequence identity to a polynucleotide sequence 
selected from the group consisting of SEQ ID NO:l-21 1; c) a polynucleotide sequence complementary 
to a); d) a polynucleotide sequence complementary to b); and e) an RNA equivalent of a) through <$). 
The method comprises a) amplifying said target polynucleotide or a fragment thereof using polymerase 
chain reaction amplification, and b) detecting the presence or absence of said amplified target 

1 o polynucleotide or fragment thereof, and, optionally, if present, the amount thereof. 

The invention also provides a method for detecting a target polynucleotide in a sample, said 
target polynucleotide comprising a polynucleotide sequence selected from the group consisting of a) a 
polynucleotide sequence selected from the group consisting of SEQ ID NO:l-21 1 ; b) a naturally 
occurring polynucleotide sequence having at least 90% sequence identity to a polynucleotide sequence 

15 selected from the group consisting of SEQ ID NO:l-21 1; c) a polynucleotide sequence complementary 
to a); d) a polynucleotide sequence complementary to b); and e) an RNA equivalent of a) through d). 
The method comprises a) hybridizing the sample with a probe comprising at least 20 contiguous 
nucleotides comprising a sequence complementary to said target polynucleotide in the sample, and 
which probe specifically hybridizes to said target polynucleotide, undo* conditions whereby a 

20 hybridization complex is formed between said probe and said target polynucleotide, and b) detecting the 
presence or absence of said hybridization complex, and, optionally, if present, the amount thereof. In 
one alternative, the probe comprises at least 30 contiguous nucleotides. In another alternative, the 
probe comprises at least 60 contiguous nucleotides. 

The invention further provides a recombinant polynucleotide comprising a promoter sequence 

2 5 operably linked to an isolated polynucleotide comprising a polynucleotide sequence selected from the 

group consisting of a) a polynucleotide sequence selected from the group consisting of SEQ ID NO:l- 
21 1 ; b) a naturally occurring polynucleotide sequence having at least 90% sequence identity to a 
polynucleotide sequence selected from the group consisting of SEQ ID NO:l-21 1; c) a polynucleotide 
sequence complementary to a); d) a polynucleotide sequence complementary to b); and e) an RNA 

3 o equivalent of a) through d). In one alternative, the invention provides a cell transformed with the 

recombinant polynucleotide. In another alternative, the invention provides a transgenic organism 
comprising the recombinant polynucleotide. In a further alternative, the invention provides a method 
for producing a human diagnostic and therapeutic polypeptide, the method comprising a) x^ilturing a 
cell under conditions suitable for expression of the human diagnostic and therapeutic polypeptide, 

111 



BNSDOC1D: <WO 01 62927A2_I_> 



WO 01/62927 




PCTYUS01/06059 



wherein said cell is transformed with the recombinant polynucleotide, and b) recovering the human 
diagnostic and therapeutic polypeptide so expressed 

The invention also provides a purified human diagnostic and therapeutic polypeptide (DITHP) 
encoded by at least one polynucleotide comprising a polynucleotide sequence selected from the group 
5 consisting of SEQ ID NO:l-21 1 . Additionally, the invention provides an isolated antibody which 
specifically binds to the human diagnostic and therapeutic polypeptide. The invention further provides 
a method of identifying a test compound which specifically binds to the human diagnostic and 
therapeutic polypeptide, the method comprising the steps of a) providing a test compound; b) combining 
the human diagnostic and therapeutic polypeptide with the test compound for a sufficient time and 

10 under suitable conditions for binding; and c) detecting binding of the human diagnostic and therapeutic 
polypeptide to the test compound, thereby identifying the test compound which specifically binds the 
human diagnostic and therapeutic, polypeptide. 

The invention further provides a microarray wherein at least one element of the microarray is 
an isolated polynucleotide comprising at least 60 contiguous nucleotides of a polynucleotide comprising 

15 a polynucleotide sequence selected from the group consisting of a) a polynucleotide sequence selected 
from the group consisting of SEQ ID NO:l-21 1 ; b) a naturally occurring polynucleotide sequence 
having at least 90% sequence identity to a polynucleotide sequence selected from the group consisting 
of SEQ ID NO:l-211; c) a polynucleotide sequence complementary to a); d) a polynucleotide sequence 
complementary to b); and e) an RNA equivalent of a) through d). The invention also provides 1 a method 

20 for generating a transcript image of a sample which contains polynucleotides. The method comprises a) 
labeling the polynucleotides of the sample, b) contacting the elements of the microarray with the labeled 
polynucleotides of the sample under conditions suitable for the formation of a hybridization complex, 
and c) quantifying the expression of the polynucleotides in the sample. 

Additionally, the invention provides a method for screening a compound for effectiveness in 

2 5 altering expression of a target polynucleotide, wherein said target polynucleotide comprises a 

polynucleotide sequence selected from the group consisting of a) a polynucleotide sequence selected 
from the group consisting of SEQ ID NO:l-21 1 ; b) a naturally occurring polynucleotide sequence 
having at least 90% sequence identity to a polynucleotide sequence selected from the group consisting 
of SEQ ID NO;l-21 1 ; c) a polynucleotide sequence complementary to a); d) a polynucleotide sequence 

30 complementary to b); and e) an RNA equivalent of a) through d). The method comprises a) exposing a 
sample comprising the target polynucleotide to a compound, and b) detecting altered expression of the 
target polynucleotide, and c) comparing the expression of the target polynucleotide in the presence of 
varying amounts of the compound and in the absence of the compound. 

The invention further provides a method for assessing toxicity of a test compound, said method 
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comprising a) treating a biological sample containing nucleic acids with the test compound; b) 
hybridizing the nucleic acids of the treated biological sample with a probe comprising at least 20 
contiguous nucleotides of a polynucleotide comprising a polynucleotide sequence selected from the 
group consisting of i) a polynucleotide sequence selected from the group consisting of SEQ ID NO:l- 
5 21 1 ; ii) a naturally occurring polynucleotide sequence having at least 90% sequence identity to a 

polynucleotide sequence selected from the group consisting of SEQ ID NO:l-211; iii) a polynucleotide 
sequence complementary to i), iv) a polynucleotide sequence complementary to ii), and v) an RNA 
equivalent of i)-iv). Hybridization occurs under ccmditions whereby a specific hybridization complex 
is formed between said probe and a target polynucleotide in the biological sample, said target 
l o polynucleotide comprising a polynucleotide sequence selected from the group consisting of i) a 
polynucleotide sequence selected from the group consisting of SEQ ID NO: 1-21 1; ii) a naturally 
occurring polynucleotide sequence having at least 90% sequence identity to a polynucleotide 
sequence selected from the group consisting of SEQ ID NO:l-21 1 ; iii) a polynucleotide sequence 

1 complementary to i), iv) a polynucleotide sequence complementary to ii), and v) an RNA equivalent 
1 5 of i)-iv), and alternatively, the target polynucleotide comprises a fragment of a polynucleotide sequence 

selected from the group consisting of i-v above; c) quantifying the amount of hybridization complex; 
and d) comparing the amount of hybridization complex in the treated biological sample with the amount . 
of hybridization complex in an untreated biological sample, wherein a difference in the amount of 
hybridization complex in the treated biological sample is indicative of toxicity of the test compound 

2 o The invention further provides an isolated polypeptide comprising an amino acid sequence 

selected from the group consisting of a) an amino acid sequence selected from the group consisting of 
SEQ ID NO:2 12-422, b) a naturally occurring amino acid sequence having at least 90% sequence 
identity to an amino acid sequence selected from the group consisting of SEQ ID NO:212-422, c) a 
biologically active fragment of an amino acid sequence selected from the^group consisting of SEQ ID 
2 5 NO:212-422, and d) an immunogenic fragment of an amino acid sequence selected from the group 
consisting of SEQ ID NO:212-422. In one alternative, the invention provides an isolated polypeptide 
comprising the amino acid sequence of SEQ ID NO:212-422. 

DESCRIPTION OF THE TABLES 

30 Table 1 shows the sequence identification numbers (SEQ ID NO:s) and template identification 

numbers (template IDs) corresponding to the polynucleotides of the present invention, along with their 
GenBank hits (GI Numbers), probability scores, and functional annotations corresponding to the 
GenBankhits. 

Table 2 shows the sequence identification numbers (SEQ ID NO:s) and template identification 
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numbers (template IDs) corresponding to the polynucleotides of the present invention, along with 
polynucleotide segments' of each template sequence as defined by the indicated "start" and "stop" 
nucleotide positions. The reading frames of the polynucleotide segments and the Pfam hits, Pfam 
descriptions, and E- values corresponding to the polypeptide domains encoded by the polynucleotide 
5 segments are indicated 

Table 3 shows the sequence identification numbers (SEQ ID NO:s) and template identification 
numbers (template IDs) corresponding to the polynucleotides of the present invention, along with 
polynucleotide segments of each template sequence as defined by the indicated "start" and "stop" 
nucleotide positions. The reading frames of the polynucleotide segments are shown, and the 

10 polypeptides encoded by the polynucleotide segments constitute either signal peptide (SP) or 

transmembrane (TM) domains, as indicated. The membrane topology of the encoded polypeptide 
sequence is indicated, the N-terminus (N) listed as being oriented to either the cytosolic (in) or non- 
cytosolic (out) side of the cell membrane or organdie. 

Table 4 shows the sequence identification numbers (SEQ ID NO:s) corresponding to the 

is polynucleotides of the present invention, along with component sequence identification numbers 
(component IDs) corresponding to each template. The component sequences, which were used to 
assemble the template sequences, are defined by the indicated "start" and "stop" nucleotide positions 
along each template. 

Table 5 shows the tissue distribution profiles for the templates of the invention. 

20 Table 6 shows the sequence identification numbers (SEQ ID NO:s) corresponding to the 

polypeptides of the present invention, along with the reading frames used to obtain the polypeptide 
segments, the lengths of the polypeptide segments, the "start" and "stop" nucleotide positions of the 
polynucleotide sequences used to define the encoded polypeptide segments, the GenBank hits (GI 
Numbers), probability scores, and functional annotations corresponding to the GenBank hits. 

2 5 Table 7 summarizes the bioinformatics tools which are useful for analysis of the 

polynucleotides of the present invention. The first column of Table 7 lists analytical tools, programs, 
and algorithms, the second column provides brief descriptions thereof, the third column presents 
appropriate references, all of which are incorporated by reference herein in their entirety, and the fourth 
column presents, whore applicable, the scores, probability values, and other parameters used to evaluate 

30 the strength of a match between two sequences (the higher the score, the greater the homology between 
two sequences). 

DETAILED DESCRIPTION OF THE INVENTION 

Before the nucleic acid sequences and methods are presented, it is to be understood that this 
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invention is not limited to the particular machines, methods, and materials described. Although 
particular embodiments are described, machines, methods, and materials similar or equivalent to these 
embodiments may bfc used to practice the invention. The preferred machines, methods, and materials 
set forth are not intended to limit the scope of the invention which is limited only by the appended 
5 claims. • ( 

The singular forms "a", "an", and "the" include plural reference unless the context clearly 
dictates otherwise. All technical and scientific terms have the meanings commonly understood by one 
of ordinary skill in the art All publications are incorporated by reference for the purpose of describing 
and disclosing the cell lines, vectors, and methodologies which are presented and which might be used in 

1 o connection with the inventioa Nothing in the specification is to be construed as an admission that the 

invention is not entitled to antedate such disclosure by virtue of prior inventioa 

i 

Definitions 

As used herein, the lower case "dithp" refers to a nucleic acid sequence, while the upper case 
15 "DITHF' refers to an amino acid sequence encoded by dithp. A "full-length" dithp refers to a nucleic 
acid sequence containing the entire coding region of a gene endogenously expressed in human tissue. 

"Adjuvants" are materials such as Freund's adjuvant, mineral gels (aluminum hydroxide), and 
surface active substances (lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, keyhole 
limpet hemocyanin, and dinitrophenol) which may be administered to increase a host's immunological 
20 response. 

"Allele" refers to an alternative form of a nucleic acid sequence. Alleles result from a 
"mutation," a change or an alternative reading of the genetic code. Any given gene may have none, one, 
or many allelic forms. Mutations which give rise to alleles include deletions, additions, or substitutions 
of nucleotides. Each of these changes may occur alone, or in combination with the others, one or more 

2 5 times in a given nucleic acid sequence. The present invention encompasses allelic dithp. 

"Amino acid sequence" refers to a peptide, a polypeptide, or a protein of either natural or 
synthetic origin. The amino acid sequence is not limited to the complete, endogenous amino acid 
sequence and may be a fragment, epitope, variant, or derivative of a protein expressed by a nucleic acid 
sequence. 

30 "Amplification" refers to the production of additional copies of a sequence and is carried out 

using polymerase chain reaction (PCR) technologies well known in the art. 

"Antibody" refers to intact molecules as weQ as to fragments thereof, such as Fab, F(ab% and 
Fv fragments, which are capable of binding the epitopic determinant. Antibodies that bind DITHP 
polypeptides can be prepared using intact polypeptides or using fragments containing small peptides of 
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( interest as the immunizing antigen. The polypeptide or peptide used to immunize an animal (e.g., a 
mouse, a rat, or a rabbit) can be derived from the translation of RNA, or synthesized chemically, and 
can be conjugated to a carrier protein if desired Commonly used carriers that are chemically coupled 
to peptides include bovine serum albumin, thyroglobulin, and keyhole limpet hemocyanin (KLH). The 
5 coupled peptide is then used to immunize the animal. 

"Antisense sequence" refers to a sequence capable of specifically hybridizing to a target 
' sequence. The antisense sequence may include DNA, RNA, or any nucleic acid mimic or analog such 

as peptide nucleic acid (PNA); oligonucleotides having modified backbone linkages such as , 
phosphorothioates, methylphosphonates, or benzylphosphonates; oligonucleotides having modified 
10 sugar groups such as 2-methoxyethyl sugars or 2'-methoxyethoxy sugars; or oligonucleotides having 
modified bases such as 5-methyl cytosine, y-deoxyuracil, or 7-deaza-2'-deoxyguanosine. 

"Antisense sequence" refers to a sequence capable of specifically hybridizing to a target 
sequence. The antisense sequence can be DNA, RNA, or any nucleic acid mimic or analog. 

"Antisense technology" refers to any technology which relies on the specific hybridization of an 
is antisense sequence to a target sequence. 

A "bin" is a portion of compute memory space used by a compute" program for storage of 
data, and bounded in such a manner that data stored in a bin may be retrieved by the program. 
. "Biologically active" refers to an amino acid sequence having a structural, regulatory, or 

biochemical function of a naturally occurring amino acid sequence. 
20 "Clone joining" is a process for combining gene bins based upon the bins* containing sequence 

. information from the same clone. The sequences may assemble into a primary gene transcript as well 
as one or more splice variants. 

"Complementary" describes the relationship between two single-stranded nucleic acid 
sequences that anneal by base-pairing (5'-A-G-T-3' pairs with its complement 3-T-C-A-5 1 ). 
25 A "component sequence" is a nucleic acid sequence selected by a computer program such as 

PHRED and used to assemble a consensus or template sequence from one or more component 
sequences. 

A "consensus sequence" or "template sequence" is a nucleic acid sequence which has been 
assembled from overlapping sequences, using a computer program for fragment assembly such as the 
3 o GELVIEW fragment assembly system (Genetics Compute" Group (GCG), Madison WI) or using a 
relational database management system (RDMS). 

"Conservative amino acid substitutions" are those substitutions that, when made, least interf ere 
with the properties of the original protein, i.e., the structure and especially the function of the protein is 
conserved and not significantly changed by such substitutions. The table below shows amino acids 
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which may be substituted for an original amino acid in a protein and which are regarded as conservative 
substitutions. 



Original Residue 


Conservative Substitution 


Ala 


Gly, Ser 


Are 


His, Lys 


Asn 


Asp, Gin, His 


Asp 


Asn, Glu 


Cys 


Ala, Ser 


Gin 


Asn, Glu, His 


Glu 


Asp, Gin, His 


Gly 


Ala 


His 


Asn, Arg, Gin, Glu 


He 


Leu, Val 


Leu 


lie, Val 


Lys 


Arg, Gin, Glu 


Met 


Leu, He 


Phe 


His, Met, Leu, Trp, Tyr 


Ser 


Cys, Thr 


Thr 


Ser, Val 


Trp 


Phe, Tyr 


Tyr 


His, Phe, Trp 


Val 


He, Leu, Thr 



Conservative substitutions generally maintain (a) the structure of the polypeptide backbone in 
the area of the substitution, for example, as a beta sheet or alpha helical conformation, (b) the charge or 
hydrophobicity of the molecule at the target site, or (c) the bulk of the side chain. 

"Deletion" refers to a change in either a nucleic or amino acid sequence in which at least one 
nucleotide or amino acid residue, respectively, is absent 

"Derivative" refers to the chemical modification of a nucleic acid sequence, such as by 
replacement of hydrogen by an alkyl, acyl, amino, hydroxyl, or other group. 

The terms "element" and "array element" refer to a polynucleotide, polypeptide, or other 
chemical compound having a unique and defined position on a microarray. 

"E-value" refers to the statistical probability that a match between two sequences occurred by 

chance. 

A "fr agmenf 9 is a unique portion of dithp or DITHP which is identical in sequence to but 
shorter in length than the parent sequence. A fragment may comprise up to the entire length of the 
defined sequence, minus one nucleotide/amino acid residue. For example, a fragment may comprise 
from 10 to 1000 contiguous amino acid residues or nucleotides. A fragment used as a probe, primer, 
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antigen, therapeutic molecule, or for other purposes, may be at least 5, 10, 15, 16, 20, 25, 30, 40, 50, 
60, 75, 100, 150, 250 or at least 500 contiguous amino acid residues or nucleotides in length 
Fragments may be preferentially selected from certain regions of a molecule. For example, a 
polypeptide fragment may comprise a certain length of contiguous amino acids selected from the first 
5 250 or 500 amino acids (or first 25% or 50%) of a polypeptide as shown in a certain defined sequence. 
Clearly these lengths are exemplary, and any length that is supported by the specification, including the 
Sequence Listing and the figures', may be encompassed by the present embodiments. 

A fragment of ditbp comprises a regiori of unique polynucleotide sequence that specifically 
identifies dithp, for example, as distinct from any other sequence in the same genome. A fragment of 

10 dithp is useful, for example, in hybridization and amplification technologies and in analogous methods 
that distinguish ditbp from related polynucleotide sequences. The precise length of a fragment of dithp 
and the region of dithp to which the fragment corresponds are routinely determinable by one of ordinary 
skill in the art based on the intended purpose for the fr agmerit 

A fragment of DITHP is encoded by a fragment of ditbp. A fragment of DITHP comprises a 

1 5 region of unique amino acid sequence that specifically identifies DITHP. For example, a fragment of 
DITHP is useful as an immunogenic peptide for the development of antibodies that specifically 
recognize DITHP. The precise length of a fragment of DITHP and the region of DITHP to which the 
fragment corresponds are routinely determinable by one of ordinary skill in the art based on the intended 
purpose for the fragment. 

20 A "full length" nucleotide sequence is one containing at least a start site for translation to a 

protein sequence, followed by an open reading frame and a stop site* and encoding a "full length" 
polypeptide. 

"Hit" refers to a sequence whose annotation will be used to describe a given template. Criteria 
for selecting the top hit are as follows: if the template has one or more exact nucleic acid matches, the 
2 5 top hit is the exact match with highest percent identity. If the template has no exact matches but has 
significant protein hits, the top hit is the protein hit with the lowest E-value. If the template has no 
significant protein hits, but does have significant non-exact nucleotide hits, the top hit is the nucleotide 
hit with the lowest E-value. 

"Homology" refers to sequence similarity either between a reference nucleic acid sequence and 
30 at least a fragment of a dithp or between a reference amino acid sequence and a fragment of a DITHP. 

"Hybridization" refers to the process by which a strand of nucleotides anneals with a 
complementary strand through base pairing. Specific hybridization is an indication that two nucleic 
acid sequences share a high degree of identity. Specific hybridization complexes form under defined 
annealing conditions, and remain hybridized after the "washing" step. The defined hybridization 
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conditions include the annealing conditions and the washing step(s), the latter of which is particularly 
important in determining the stringency of the hybridization process, with more stringent conditions 
allowing less non-specific binding, i.e., binding between pairs of nucleic acid probes that are not 
perfectly matched. Permissive conditions for annealing of nucleic acid sequences are routineLy 
5 determinable and may be consistent among hybridization experiments, whereas wash conditions may be 
varied among experiments to achieve the desired stringency. 

Generally, stringency of hybridization is expressed with reference to the temperature undo* 
which the wash step is carried out Generally, such wash temperatures are selected to be about 5°C to 
20°C lower than the thermal melting point (J^ for the specific sequence at a defined ionic strength and 

10 pH. The T m is the temperature (undo: defined ionic strength and pH) at which 50% of the target 

sequence hybridizes to a perfectly matched probe. An equation for calculating T m and conditions for 
nucleic acid hybridization is well known and can be found in Sambrook et al., 1989, Molecular 
Cloning: A Laboratory Manual . 2 nd ed., vol. 1-3, Cold Spring Harbor Press, Plainview NY; specifically 

' see volume 2, chapter 9, 

15 High stringency conditions for hybridization between polynucleotides of the present invention 

include wash conditions of 68°C in the presence of about 0.2 x SSC and about 0.1% SDS, for 1 hour. 
Alternatively, temperatures of about 65°C, 60°C, or 55°C may be used. SSC concentration may be 
varied from about 0.2 to 2 x SSC, with SDS being present at about 0.1%. Typically, blocking reagents 
are used to block non-specific hybridization. Such blocking reagents include* for instance, denatured 

2 o salmon sperm DNA at about 100-200 ng/ml. Useful variations on these conditions will be readily 
apparent to those skilled in the art. Hybridization, particularly under high stringency conditions, may 
be suggestive of evolutionary similarity between the nucleotides. Such similarity is strongly indicative 
of a similar role for the nucleotides and their resultant proteins. 

Other parameters, such as temperature, salt concentration; and detergent concentration may be 

2 5 varied to achieve the desired stringency. Denaturants, such as formamide at a concentration of about 

35-50% v/v, may also be used under particular circumstances, such as RNA:DNA hybridizations. 
Appropriate hybridization conditions are routinely determinable by one of ordinary skill in the art. 

"Immunogenic" describes the potential for a natural, recombinant, or synthetic peptide, epitope, 
polypeptide, or protein to induce antibody production in appropriate animals, ceils, or cell lines. 

3 o "Insertion" or "addition" refers to a change in either a nucleic or amino acid sequence in which 

at least one nucleotide or residue, respectively, is added to the sequence. 

"Labeling" refers to the covalent or noncovalent joining of a polynucleotide, polypeptide, or 
antibody with a reporter molecule capable of producing a detectable or measurable signal. 

119 



BNSDOCID: <WO 01 62927A2_I_> 



WO 01/62927 



PCT/US01/06059 



"Microarray" is any arrangement of nucleic acids, amino acids, antibodies, etc., on a substrate. 

The substrate may be a solid support such as beads, glass, papa*, nitrocellulose, nylon, or an 

appropriate membrane. . 

i t • 

"Linkers" are short stretches of nucleotide sequence which may be added to a vector or a dithp 

5 to create restriction endonuclease sites to facilitate cloning. "Polylinkers" are engineered to incorporate 

multiple restriction enzyme sites and to provide for the use of enzymes which leave 5' or, 3* overhangs 

(ag., BamHI, EcoRI, and Hindin) and those which provide blunt ends (e.g., EcoRV, SnaBI, and StuI). 

i 

"Naturally occurring" refers to an endogenous polynucleotide or polypeptide that may be 
isolated from viruses or prokaryotic or eukaryotic ceils. 

10 "Nucleic acid sequence" refers to the specific order of nucleotides joined by phosphodiester 

. bonds in a linear, polymeric arrangement Depending on the number of nucleotides, the nucleic acid 
. sequence can be considered an oligomer, oligonucleotide, or polynucleotide. The nucleic acid can be 
DNA, RNA, or any nucleic acid analog, such as PNA, may be of genomic or synthetic origin, may be 
either double-stranded or single-stranded, and can represent either the sense or antisense 

15 (complementary) strand. 

"Oligomer" refers to a nucleic acid sequence of at least about 6 nucleotides and as many as 
about 60 nucleotides, preferably about 15 to 40 nucleotides, and most preferably between about 20 and 
30 nucleotides, that may be used in hybridization or amplification technologies. Oligomers may be used 
as, e.g., primers for PCR, and are usually chemically synthesized. 

2 o "Operably linked" refers to the situation in which a first nucleic acid sequence is placed in a 

functional relationship with the second nucleic acid sequence. For instance, a promoter is operably 
linked to a coding sequence if the promoter affects the transcription or expression of the coding 
sequence. Generally, operably linked DNA sequences may be in close proximity or contiguous and, 
where necessary to join two protein coding regions, in the same reading frame. 

2 s 'Teptide nucleic acid" (PNA) refers to a DNA mimic in which nucleotide bases are attached to 

a pseudopeptide backbone to increase stability. PNAs, also designated antigene agents, can prevent 
gene expression by targeting complementary messenger RNA 

The phrases "percent identity" and "% identity", as applied to polynucleotide sequences, refer 
to the percentage of residue matches between at least two polynucleotide sequences aligned using a 

3 o standardized algorithm Such an algorithm may insert, in a standardized and reproducible way, gaps in 

the sequences being compared in order to optimize alignment between two sequences, and therefore 
achieve a more meaningful comparison of the two sequences. 

Pa-cent identity between polynucleotide sequences may be determined using the default 
parameters of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3. 12e sequence 
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alignment program. This program is part of the LASERGENE software package, a suite of molecular 
biological analysis programs. (DNASTAR, Madison WI). CLUSTAL V is described in Higgins, D.G. 
and Sharp, P.M, (1989) CABIOS 5:151-153 and in Higgins, D.G. et al. (1992) CABIOS 8:189-191. 
For pairwise alignments of polynucleotide sequences, the default parameters are set as follows: 
5 Ktuple=2, gap penalty=5, window=4, and "diagonals saved"=4. , The "weighted" residue weight table is 
selected as the default Percent identity is reported by CLUSTAL V, as the "percent similarity" between 
aligned polynucleotide sequence pairs. 

Alternatively, a suite of commonly used and freely available sequence comparison algorithms is 
provided by the National Center for Biotechnology Information (NCBI) Basic Local Alignment Search 
10 Tool (BLAST) (Altschul, S.F. et al. (1990) J. MoL Biol. 215:403-410), which is available from several 
sources, including the NCBI, Bethesda, MD, and on the Internet at 

httpy/www.ncbi.nlm.nih.gov/BLAST/. The BLAST software suite includes various sequence analysis 
programs including "blastn," that is used to determine alignment between a known polynucleotide 
sequence and other sequences on a variety of databases. Also available is a tool called "BLAST 2 
1 5 Sequences" that is used for direct pairwise comparison of two nucleotide sequences. "BLAST 2 
Sequences" can be accessed and used interactively at http-y/www.ncbi.rilm.nihgov/gor£^bl2/. The 
"BLAST 2 Sequences" tool can be used for both blastn and blastp (discussed below). BLAST 
programs are commonly used with gap and other parameters set to default settings. For example, to 
compare two nucleotide sequences, one may use blastn with the "BLAST 2 Sequences" tool Version 
20 2.0.9 (May-07-1999) set at default parameters. Such default parameters may be, for example: 

Matrix: BLOSUM62 

Reward for match: 1 

Penalty for mismatch: -2 

Open Gap: 5 and Extension Gap: 2 penalties 
2 5 Gap x drop-off: 50 

Expect: 10 

Word Size: 11 

Filter: on 

Percent identity may be measured over the length of an entire defined sequence, for example, as 
30 defined by a particular SEQ ID number, or may be measured over a shorter length, for example, over 
the length of a fragment taken from a larger, defined sequence, for instance, a fragment of at least 20, at 
least 30, at least 40, at least 50, at least 70, at least 100, or at least 200 contiguous nucleotides. Such 
lengths are exemplary only, and it is understood that any fragment length supported by the sequences 
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t shown herein, in figures or Sequence Listings, may be used to describe a length over which percentage 
identity may be measured. 

Nucleic acid sequences that do not show a high degree of identity may nevertheless encode 
similar amino acid sequences due to the degeneracy of the genetic code. It is understood that changes in 
5 nucleic acid sequence can be made using this degeneracy to produce multiple nucleic acid sequences 
that all encode substantially the same protein. 

The phrases percent identity" and "% identity", as applied to polypeptide sequences, refer to 
the percentage of residue matches between at least two polypeptide sequences aligned using a 
standardized algorithm. Methods of polypeptide sequence alignment are well-known. Some alignment 
10 methods take into account conservative amino acid substitutions. Such conservative substitutions, 
explained in more detail above, generally preserve the hydrophobicity and acidity of the substituted 
residue, thus preserving the structure (and therefore function) of the folded polypeptide; 

Percent identity between polypeptide sequences may be determined using the default parameters 
of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3. 12e sequence alignment 
15 program (described and referenced above). For pairwise alignments of polypeptide sequences using 
CLUSTAL V, the default parameters are set as follows: Ktuple=l, gap penalty=3, window=5, and 
"diagonals saved"=5. The PAM250 matrix is selected as the default residue weight table. As with 
polynucleotide alignments, the percent identity is reported by CLUSTAL V as the "percent similarity" 
between aligned polypeptide sequence pairs. 
20 Alternatively the NCBI BLAST software suite may be used. For example, for a pairwise 

comparison of two polypeptide sequences, one may use the "BLAST 2 Sequences" tool Version 2.0.9 
(May-07-1999) with blastp set at default parameters. Such default parameters may be, for example: 
Matrix: BLOSUM62 

Open Gap: 11 and Extension Gap: 1 penalty 

2 5 Gap x drop-off: 50 

Expect: 10 
Word Size: 3 
Filter: on 

Percent identity may be measured over the length of an entire defined polypeptide sequence, for 

3 o example, as defined by a particular SEQ ID number, or may be measured over a shorter length, for 

example* over the length of a fragment taken from a larger, defined polypeptide sequence* for instance, 
a fragment of at least 15, at least 20, at least 30, at least 40, at least 50, at least 70 or at least 150 
contiguous residues. Such lengths are exemplary only, and it is understood that any fragment length 



122 



BNSDOCID: <WO 0162927A2J_> 



WO 01/62927 



PCT7US01/06059 



supported by the sequences shown herein, in figures or Sequence Listings, may be used to describe a 
length ova: which percentage identity may be measured. 

' "Post-translational modification" of a DITHP may involve lipidation, glycosylation, 
phosphorylation, acetylation, racemization, proteolytic cleavage, and other modifications known in the 
5 art. These processes may occur synthetically or biochemically. Biochemical modifications will vary by 
cell type depending on the enzymatic milieu and the DITHP. 

"Probe" refers to dithp or fragments thereof, which are used to detect identical, allelic or related 
nucleic acid sequences. Probes are isolated oligonucleotides or polynucleotides attached to a detectable 
label or reporter molecule. Typical labels include radioactive isotopes, ligands, chemiluminescent 
10 agents, and enzymes. 'Trimers" are short nucleic acids, usually DNA oligonucleotides, which may be 
annealed to a target polynucleotide by complementary base-pairing. The primer may then be extended 
along the target DNA strand by a DNA polymerase enzyme. Prima" pairs can be used for amplification 
(and identification) of a nucleic acid sequence, e.g., by the polymerase chain reaction (PCR). 

1 Probes and primers as used in the present invention typically comprise at least 15 contiguous 
15 nucleotides of a known sequence. In order to enhance specificity, longer probes and primers may also 

be employed, such as probes and primers that comprise at least 20, 30, 40, 50, 60, 70, 80, 90, 100, or 
at least 150 consecutive nucleotides of the disclosed nucleic acid sequences. Probes and primers may 

i 

be considerably longer than these examples, and it is understood that any length supported by the 1 
specification, including the figures and Sequence Listing, may be used. 

2 o Methods for preparing and using probes and primers are described in the references, for 

example Sambrook et al., 1989, Molecular Cloning: A Laboratory Manual , 2 nd ed, vol. 1-3, Cold 
Spring Harbor Press, Plainview NY; Ausubel et al., 1987, Current Protocols in Molecular Biology , 
Greene Publ. Assoc. & Wfley-Intersciences, New York NY; Innis et al., 1990, PCR Protocols* A Guide 
to Methods and Applications , Academic Press, San Diego CA PCR primer pairs can be derived from 
25 a known sequence, for example, by using computer programs intended for that purpose such as Primer 
(Version 0.5, 1991, Whitehead Institute for Biomedical Research, Cambridge MA). 

Oligonucleotides for use as primers are selected using software known in the art for such 
purpose. For example, OLIGO 4.06 software is useful for the selection of PCR prima- pairs of up to 
100 nucleotides each, and for the analysis of oligonucleotides and larger polynucleotides of up to 5,000 

3 o nucleotides from an input polynucleotide sequence of up to 32 kilobases. Similar primer selection 

programs have incorporated additional features for expanded capabilities. For example, the PrimOU 
prima- selection program (available to the public from the Genome Center at University of Texas South 
West Medical Center, Dallas TX) is capable of choosing specific primers from megabase sequences 
and is thus useful for designing primers on a genome- wide scope. The Primer3 primer selection 
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program (available to the public from the Whitehead Institute/MIT Center for Genome Research, 
Cambridge MA) allows the user to input a "mispriming library/' in which sequences to avoid as primer 
binding sites are user-specified Primer3 is usefiil, in particular, for the selection of oligonucleotides for 
microarrays. (The source code for the latter two primer selection programs may also be obtained from 
5 their respective sources and modified to meet the user's specific needs.) The PrimeGen program 
(available to the public from the UK Human Genome Mapping Project Resource Centre, Cambridge 
UK) designs primers based on multiple sequence alignments, thereby allowing selection of primers that 
hybridize to either the most conserved or least conserved regions of aligned nucleic acid sequences. 
Hence, this program is useful for identification of both unique and conserved oligonucleotides and 
10 polynucleotide fragments. The oligonucleotides and polynucleotide fragments identified! by any of the 
above selection methods are useful in hybridization technologies, for example, as PCR or sequencing 
primers, microarray elements, or specific probes to identify fully or partially complementary 

polynucleotides in a sample of nucleic acids. Methods of oligonucleotide selection are not limited to 

1 i 

those described above. 

15 "Purified" refers to molecules, either polynucleotides or polypeptides that are isolated or 

separated from their natural environment and are at least 60% free, preferably at least 75% free, and 
most preferably at least 90% free from other compounds with which they are naturally associated. 

A "recombinant nucleic acid" is a sequence that is not naturally occurring or has a sequence 
that is made by an artificial combination of two or more otherwise separated segments of sequence. 

2 o This artificial combination is often accomplished by chemical synthesis or, more commonly, by the 
artificial manipulation of isolated segments of nucleic acids, e.g., by genetic engineering techniques 
such as those described in Sambrook, supra . The tern recombinant includes nucleic acids that have 
been altered solely by addition, substitution, or deletion of a portion of the nucleic acid. Frequently, a 
recombinant nucleic acid may include a nucleic acid sequence operably linked to a promoter sequence. 

2 5 Such a recombinant nucleic acid may be part of a vector that is used, for example, to transform a cell. 

Alternatively, such recombinant nucleic acids may be part of a viral vector, e.g., based on a 
vaccinia virus, that could be use to vaccinate a mammal wherein the recombinant nucleic acid is 
expressed, inducing a protective immunological response in the mammal. 

"Regulatory element" refers to a nucleic acid sequence from nontranslated regions of a gene, 

3 o and includes enhancers, promoters, introns, and 3' untranslated regions, which interact with host 

proteins to carry out or regulate transcription or translation. 

"Reporter" molecules are chemical or biochemical moieties used for labeling a nucleic acid, an 
amino acid, or an antibody. They include radionuclides; enzymes; fluorescent, chemiluminescent, or 
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chromogenic agents; substrates; cofactors; inhibitors; magnetic particles; and other moieties known in 
the art. 

An "RtyA equivalent," in reference to a DNA sequence, is composed of the same linear 
sequence of nucleotides as the reference DNA sequence with the exception that all occurrences of the 
5 nitrogenous base thymine are replaced with uracil, and the sugar backbone is composed, of ribose 
instead of deoxyribose. 

"Sample" is used in its broadest sense. Samples may contain nucleic or amino acids, 
antibodies, or other materials, and may be derived from any source (e.g„ bodily fluids including, but not 
limited to, saliva, blood, and urine; chromosome(s), organelles, or membranes isolated from a cell; 
10 genomic DNA, RNA, or cDNA in solution or bound to a substrate; and cleared cells or tissues or blots 
or imprints from such cells or tissues). 

"Specific binding" or "specifically binding" refers to the interaction between a protean or 
peptide and its agonist, antibody, antagonist, or other binding partner. The interaction is dependent 
upon the presence of a particular structure of the protein, e.g., the antigenic determinant or epitope, 
15 recognized by the binding molecule. For example, if an antibody is specific for epitope "A," the 
presence of a polypeptide containing epitope A, or the presence of free unlabeled A, in a reaction 
containing free labeled A and the antibody will reduce the amount of labeled A that binds to the 
antibody. 

"Substitution" refers to the replacement of at least one nucleotide or amino acid by a different 

2 o nucleotide or amino add. 

"Substrate" refers to any suitable rigid or semi-rigid support including, e.g., membranes, filters, 
chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, polymers, 
microparticles or capillaries. The substrate can have a variety of surface forms, such as wells, 
trenches, pins, channels and pores, to which polynucleotides or polypeptides are bound. 
25 A 'transcript image" refers to the collective pattern of gene expression by a particular tissue or 

cell type under given conditions at a given time. 

"Transformation" refers to a process by which exogenous DNA enters a recipient cdl. 
Transformation may occur under natural or artificial conditions using various methods well known in 
the art Transformation may rely on any known method for the insertion of foreign nucleic acid 

3 o sequences into a prokaryotic or eukaryotic host cell. The method is selected based on the host cell being 

transformed. 

"Transformants" include stably transformed cells in which the inserted DNA is capable of 
replication either as an autonomously replicating plasmid or as part of the host chromosome, as well as 
cells which transiently express inserted DNA or RNA. 
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A "transgenic organism," as used herein, is any organism, including but not limited to animals 

and plants, in which one or more of the cells of the organism contains heterologous nucleic add 

i • 

introduced by way of human intervention, such as by transgenic techniques well known in the art The 
nucleic acid is introduced into the cell, directly or indirectly by introduction into a precursor of the cell, 
5 by way of deliberate genetic manipulation, such as by microinjection or by infection with a recombinant 
virus. The tern genetic manipulation does not include classical cross-breeding, or in vitro fertilization, 
i but rather is directed to the introduction of a recombinant DNA molecule. The transgenic organisms 

contemplated in accordance with the present invention include bacteria, cyanobacteria, fungi, and plants 
and animals. The isolated DNA of the present invention can be introduced into the host by methods 
10 known in the art, for example infection, transfection, transformation or transconjugation. Techniques 
for transferring the DNA of the present invention into such organisms are widely known and provided in 
references such as Sambrook et al. (1989), supra . 

A "variant* ' of a particular nucleic acid sequence is defined as a nucleic acid sequence having at 
least 25% sequence identity to the particular nucleic acid sequence ova: a certain length of one of the 

1 is nucleic acid sequences using blastn with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07-1999) 

set at default parameters. Such a pair of nucleic acids may show, for example, at least 30%, at least 
50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95% or even at least 98% or 
i greater sequence identity over a certain defined length. The variant may result in "conservative" amino , 

acid changes which do not affect structural and/or chemical properties. A variant may be described as, 
20 for example, an "allelic" (as defined above), "splice," "species," or "polymorphic" variant A splice 
variant may have significant identity to a reference molecule, but wiD generally have a greater or lesser 
number of polynucleotides due to alternate splicing of exons during mRNA processing. The 
corresponding polypeptide may possess additional functional domains or lack domains that are present 

in the reference molecule. Species variants are polynucleotide sequences that vary from one species to 

i 

2 5 another. The resulting polypeptides generally will have significant amino acid identity relative to each 

other. A polymorphic variant is a variation in the polynucleotide sequence of a particular gene between 
individuals of a given species. Polymorphic variants also may encompass "single nucleotide 
polymorphisms" (SNPs) in which the polynucleotide sequence varies by one base. The presence of 
SNPs may be indicative of, for example, a certain population, a disease state, or a propensity for a 
30 disease state. 

In an alternative, variants of the polynucleotides of the present invention may be generated 
through recombinant methods. One possible method is a DNA shuffling technique such as 
MOLECUL ARB REEDING (Maxygen Inc., Santa Clara CA; described in U.S. Patent Number 
5,837,458; Chang, C.-C et al. (1999) Nat Biotechnol. 17:793-797; Christians, F.C et al. (1999) Nat 
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Biotechnol. 17:259-264; and Crameri, A. et al. (1996) Nat Biotechnol. 14:315-319) to alto: or improve 
the biological properties of DITHP, such as its biological or enzymatic activity or its ability to bind to 
other molecules or compounds. DNA shuffling is a process by which a library of gene variants is 
produced using PCR-mediated recombination of gene fragments. The library is then subjected to 
5 selection or screening procedures that identify those gene variants with the desired properties. These 
preferred variants may then be pooled and further subjected to recursive rounds of DNA shuffling and 
selection/screening. Thus, genetic diversity is created through "artificial" breeding and rapid molecular 
evolution. For example, fragments of a single gene containing random point mutations may be 

recombined, screened, and then reshuffled until the desired properties are optimized. Alternatively, 

, ■ i 

1 o fragments of a given gene may be recombined with fragments of homologous genes in the same gene 

family, either from the same or different species, thereby maximizing the genetic diversity of multiple 
naturally occurring genes in a directed and controllable manner. 

A "variant" of a particular polypeptide sequence is defined as a polypeptide sequence having 
at least 40% sequence identity to the particular polypeptide sequence over a certain length of one of 
15 the polypeptide sequences using blastp with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07- 
1999) set at default parameters. Such a pair of polypeptides may show, for example, at least 50%, at 
least 60%, at least 70%, at least 80%, at least 90%, at least 95%, or at least 98% or greater sequence 
identity over a certain defined length of one of the polypeptides. 

2 0 THE INVENTION 

In a particular embodiment, cDNA sequences derived from human tissues and cell lines were 
aligned based on nucleotide sequence identity and assembled into "consensus" or "template" sequences 
which are designated by the template identification numbers (template IDs) in column 2 of Table 1 . 
The sequence identification numbers (SEQ ID NO:s) corresponding to the template IDs are shown in 

2 5 column 1 . The template sequences have similarity to GenBank sequences, or "hits," as designated by 

the GI Numbers in column 3. The statistical probability of each GenBank hit is indicated by a 
probability score in column 4, and the functional annotation corresponding to each GenBank hit is listed 
in column 5. 

The invention incorporates the nucleic acid sequences of these templates as disclosed in the 

3 o Sequence Listing and the use of these sequences in the diagnosis and treatment of disease states 

characterized by defects in human molecules. The invention further utilizes these sequences in 
hybridization and amplification technologies, and in particular, in technologies which assess gene 
expression patterns correlated with specific cells or tissues and their responses in vivo or in vitro to 
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pharmaceutical agents, toxins, and other treatments. In this manna:, the sequences of the presort 
invention are used to develop a transcript image for a particular cell or tissue. 

Derivation of Nucleic Acid Sequences 
5 cDN A was isolated from libraries constructed using RN A derived from normal and diseased 

human tissues and cell lines. The human tissues and cell lines used for cDNA library construction were 
selected from a broad range of sources to provide a diverse population of cDNAs representative of gene 
transcription throughout the human body. Descriptions of the human tissues and cell lines used far 
cDNA library construction are provided in the LIFESEQ database (Incyte Genomics, Inc. (Incyte), Palo 

1 o Alto CA). Human tissues were broadly selected from, for example, cardiovascular, dermatologic, 

endocrine, gastrointestinal, hematopoietic/immune system, musculoskeletal, neural, reproductive, and 
urologic sources. 

Cell lines used for cDNA library construction ware derived from, for example, leukemic cells, 
teratocarcinomas, neuroepitheliomas, cervical carcinoma, lung fibroblasts, and endothelial ceils. Such 
as cell lines include, for example, THP-1, Jurkat, HUVEC, hNT2, WI38, HeLa, and other cell lines 
commonly used and available from public depositories (American Type Culture Collection, Manassas 
VA). Prior to mRNA isolation, cell lines were untreated, treated with a pharmaceutical agent such as 
5 -aza~2 -deoxycytidine, treated with an activating agent such as lipopolysaccharide in the case of 
leukocytic cell lines, or, in the case of endothelial cell lines, subjected to shear stress. 

20 

Sequencing of the cDNAs 

Methods for DNA sequencing are well known in the art Conventional enzymatic methods 
employ the Klenow fragment of DNA polymerase I, SEQUENASE DNA polymerase (U.S. 

Biochemical Corporation, Cleveland OH), Taq polymerase (Applied Biosystems, Foster City CA), 

i 

2 5 thermostable T7 polymerase (Amersham Pharmacia Biotech, Inc. (Amersham Pharmacia Biotech), 

Piscataway NJ), or combinations of polymerases and proofreading exonucleases such as those found in 
theELONGASE amplification system (Life Technologies Inc. (Life Technologies), Gaithersburg MD), 
to extend the nucleic acid sequence from an oligonucleotide primer annealed to the DNA template of 
interest Methods have been developed for the use of both single-stranded and double-stranded 

3 o templates. Chain termination reaction products may be electrophoresed on urea-polyacrylamide gels 

and detected either by autoradiography (for radioisotope-labeled nucleotides) or by fluorescence (for 
fluorophore-labeled nucleotides). Automated methods for mechanized reaction preparation, sequencing, 
and analysis using fluorescence detection methods have been developed. Machines used to prepare 
cDNAs for sequencing can include the MICROLAB 2200 liquid transfer system (Hamilton Company 
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(Hamilton), Reno NV), Peltier thermal cycler (PTC200; MJ Research, Inc. (MJ Research), Watertown 
MA), and ABI CATALYST 800 thermal cycler (Applied Biosystems). Sequencing can be carried out 
using, for example, the ABI 373 or 377 (Applied Biosystems) or MEGABACE 1000 (Molecular 
Dynamics, Inc. (Molecular Dynamics), Sunnyvale CA) DNA sequencing systems, or other automated 
5 and manual sequencing systems well known in the art 

The nucleotide sequences of the Sequence Listing have been prepared by current, state-of-the- 
art, automated methods and, as such, may contain occasional sequencing errors or unidentified 
nucleotides. Such unidentified nucleotides are designated by an N. These infrequent unidentified bases 
do not represent a hindrance to practicing the invention for those skilled in the art Several methods 
l o employing standard recombinant techniques may be used to correct errors and complete the missing 
sequence information. (See, e.g„ those described in AusubeJ, F.M. et al. (1997) Short Protocols in 
Molecular Biology . John Wiley & Sons, New York NY; and Sambrook, J. et al. (1989) Molecular 
Cloning. A Laboratory Manual Cold Spring Harbor Press, Plainview NY.) 

i 

15 Assembly of cDNA Sequences 

Human polynucleotide sequences may be assembled using programs or algorithms well known 
in the art Sequences to be assembled are related, wholly or in part, and may be derived from a single 
or many different transcripts. Assembly of the sequences can be performed using such programs as 
PHRAP (Phils Revised Assembly Program) and the GELVIEW fragment assembly system (GCG), or 
20 other methods known in the art. 

Alternatively, cDNA sequences are used as "component" sequences that are assembled into 
"template" or "consensus" sequences as follows. Sequence chromatograms are processed, verified, and 

quality scores are obtained using PHRED. Raw sequences are edited using an editing pathway known 

i 

as Block 1 (See, e.g., the LIFESEQ Assembled User Guide, Incyte Genomics, Palo Alto, CA). A series 
25 of BLAST comparisons is pa-formed and low-information segments and repetitive elements (e.g., 

dinucleotide repeats, Alu repeats, etc.) are replaced by "n's", or masked, to prevent spurious matches. 
Mitochondrial and ribosomal RNA sequences are also removed The processed sequences are then 
loaded into a relational database management system (RDMS) which assigns edited sequences to 
existing templates, if available. When additional sequences are added into the RDMS, a process is 
3 o initiated which modifies existing templates or creates new templates from works in progress (i.e., 

nonfinal assembled sequences) containing queued sequences or the sequences themselves. After the new 
sequences have been assigned to templates, the templates can be merged into bins. If multiple templates 
exist in one bin, the bin can be split and the templates reannotated. 

129 



BNSOOCID: <WQ 0162927A2_I_> 



WO 01/62927 



PCT/US01/06059 



Once gene bins have been generated based upon sequence alignments,' bins are "clone joined" 
based upon clone information. Clone joining occurs when the 5' sequence of one clone is present in one 
bin and the 3' sequence from the same clone is present in a different bin, indicating that the two bins 
should be merged into a single bin. Only bins which share at least two different clones are merged 
5 A resultant template sequence may contain either a partial or a full length open reading frame, 

or all or part of a genetic regulatory element This variation is due in part to the fact that the full length 
cDNAs of many genes are several hundred, and sometimes several thousand, bases in length. With 
current technology, cDNAs comprising the coding regions of large genes cannot be cloned because of 

vector limitations, incomplete reverse transcription of the mRNA, or incomplete "second strand" 

i 

10 synthesis. Template sequences may be extended to include additional contiguous sequences derived 
from the parent RNA transcript using a variety of methods known to those of skill in the art. Extension 
may thus be used to aphieve the full length coding sequence of a gene* 

Analysis of the cDNA Sequences 

15 The cDNA sequences are analyzed using a variety of programs and algorithms which are well 

known in the art. (See, e.g., Ausubel, 1997, supra , Chapter 7.7; Meyers, R. A (Ed.) (1995) Molecular 
Biology and Biotechnology . Wiley VCH, New York NY, pp. 856-853; and Table 7.) These analyses 
comprise both reading frame determinations, e.g. , based on triplet codon periodicity for particular 
organisms (Fickett; J.W. (1982) Nucleic Acids Res. 10:5303-5318); analyses of potential start and stop 

2 o codons; and homology searches. 

Computer programs known to those of skin in the art for performing computer-assisted 
searches for amino acid and nucleic acid sequence similarity, include, for example, Basic Local 
Alignment Search Tool (BLAST; Altschul, S.F. (1993) J. Mol. Evol. 36:290-300; Altschul, S.F. et al. 
(1990) J. Mol. Biol. 215:403-410). BLAST is especially useful in determining exact matches and 

25 comparing two sequence fragments of arbitrary but equal lengths, whose alignment is locally maximal 
and for which the alignment score meets or exceeds a threshold or cutoff score set by the user (Karlin, 
S. et aL (1988) Proc. NatL Acad. Sci. USA 85:841-845). Using an appropriate search tool (e.g., 
BLAST or HMM), GenBank, SwissProt, BLOCKS, PFAM and other databases may be searched for 
sequences containing regions of homology to a query dithp or DITHP of the present invention. 

30 Other approaches to the identification, assembly, storage, and display of nucleotide and 

polypeptide sequences are provided in "Relational Database for Storing Biomolecule Information," 
U.S.S.N. 08/947,845, filed October 9, 1997; "Project-Based Full-Length Biomolecular Sequence 
Database," U.S.S.N. 08/81 1,758, filed March 6, 1997; and "Relational Database and System for 
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Storing Information Relating to Biomolecular Sequences/' U.S.S.N. 09/034,807, filed March 4, 1998, 
all of which are incorporated by reference herein in their entirety. 

Protein bier archies can be assigned to the putative encoded polypeptide based on, e.g„ motif, 
BLAST, or biological analysis. Methods for assigning these hierarchies are described, for example, in 
5 "Database System Employing Protein Function Hierarchies for Viewing Biomolecular Sequence Data," 
U.S.S.R 08/812,290, filed March 6, 1997, incorporated herein by reference. 

Identification of Human Diagnostic and Therapeutic Molecules Encoded bv dithp 

The identities of the DITHP encoded by the dithp of the present invention were obtained by 

10 analysis of the assembled cDN A sequences. 

SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, 
SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID 
NO:222, and SEQ ID NO:223, encoded by SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID 
NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO:10, 

15 SEQ ID NO:ll, and SEQ ID NO:12, respectively, are, for example, human enzyme molecules. 

SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, and SEQ ID NO:228, 
encoded by SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, and SEQ ID NO:17, 
respectively, are, for example, receptor molecules. 

SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID NO:233, 

2 o SEQ ID NO:234, SEQ ID NO:235 , SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID 
NO:239, SEQ ID NO:240, and SEQ ID NO:241, encoded by SEQ ID NO: 18, SEQ ID NO: 19, SEQ 
ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:25, SEQ 
ID NO:26, SEQ ID NO;27, SEQ ID NO:28, SEQ ID NO:29, and SEQ IDNO:30, respectively, are, 
for example, intracellular signaling molecules. 

25 SEQ ID NO:242, SEQ ID NO:243, SEQ ID NO:244, SEQ ID NO:245, SEQ ID NO:246, 

SEQ ID NO:247, SEQ ID NO:248, SEQ ID NO:249, SEQ ID NO:250, SEQ ID NO:251, SEQ ID 
NO:252, SEQ ID NO:253, SEQ ID NO:254, SEQ ID NO:255, SEQ ID NO:256, SEQ ID NO:257, 
SEQ ID NO:258, SEQ ID NO:259, SEQ ID NO:260, SEQ ID NO:261, SEQ ID NO:262, SEQ ID 
NO:263, SEQ ID NO:264, SEQ ID NO:265, SEQ ID NO:266, SEQ ID NO:267, SEQ ED NO:268, 

30 SEQ ID NO:269, SEQ ID NO:270, SEQ ID NO:271, SEQ ID NO:272, SEQ ID NO:273, SEQ ID 
NO:274, SEQ ID NO:275, SEQ ID NO:276, SEQ ID NO:277, SEQ ID NO:278, SEQ ID NO:279, 
SEQ ID NO:280, SEQ ID NO:281, SEQ ID NO:282, SEQ ID NO:283, SEQ ID NO:284, SEQ ID 
NO:285, SEQ ID NO:286, SEQ ID NO:287, SEQ ID NO:288, SEQ ID NO:289, SEQ ID NO:290, 
SEQ ID NO:291, SEQ ID NO:292, SEQ ID NO:293, SEQ ID NO:294, SEQ ID NO:295, SEQ ID 
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NO:296, SEQ ID NO:297, SEQ ID NO:298, SEQ ID NO:299, SEQ ID NO:300, SEQ ID NO:301, 
SEQ ID NO:302, SEQ ID NO:303, SEQ ID NO:304, SEQ ID NO:305, SEQ ID NO:306, SEQ ID 
NO:307, SEQ ID NO:308, and SEQ ID NO:309, encoded by SEQ ID N0:31, SEQ ID NO:32, SEQ 
ID NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ 
5 ID NO:39, SEQ ID NO:40, SEQ ID N0:41, SEQ ID NO:42, SEQ ID NO:43, SEQ ID NO:44, SEQ 
ID NO:45, SEQ ID NO:46, SEQ ID NO:47, SEQ ID NO:48, SEQ ID NO:49, SEQ ED NO:50, SEQ 
ID N0:51, SEQ ID NO:52, SEQ ID NO:53, SEQ ID NO:54, SEQ ID NO:55, SEQ ID NO:56, SEQ 
ID NO:57, SEQ ID NO:58, SEQ ID NO:59, SEQ ID NO:60, SEQ ID N0:61, SEQ ID NO:62, SEQ 
ID NO:63, SEQ ID NO:64, SEQ ID NO:65, SEQ ID NO:66, SEQ ID NO:67, SEQ ID NO:68, SEQ 

10 ID NO:69, SEQ ID NO:70, SEQ ID N0:71, SEQ ID NO:72, SEQ ID NO:73, SEQ ID NO:74, SEQ 
ED NO:75. SEQ ID NO:76, SEQ ID NO:77, SEQ ID NO:78. SEQ ID NO:79, SEQ ID NO:80, SEQ 
ID N0:81, SEQ ID NO:82, SEQ ID NO:83, SEQ ID NO:84, SEQ ID NO:85, SEQ ID NO:86, SEQ 
ID NO:87, SEQ ID NO:88, SEQ ID NO:89, SEQ ID NO:90, SEQ ID N0:91, SEQ ID NO:92, SEQ 
ID NO:93, SEQ ED N0.94, SEQ ID NO:95, SEQ ED NO:96, SEQ ID NO:97, and SEQ ED NO:98, 

1 1 5 respectively, are, for example, transcription factor molecules. 

SEQ ED NO:310, SEQ ED NO:31 1, SEQ ED NO:312, SEQ ED NO:313, SEQ ED NO:314, 
SEQ ED NO:315, SEQ ED NO:316, and SEQ ID NO:317i encoded by SEQ ID NO:99, SEQ ED 
NO:100, SEQ ID NO:101, SEQ ED NO:102, SEQ ED NO:103, SEQ ED NO:104, SEQ ED NO:105, 
and SEQ ED NO:106, respectively, are, for example, membrane transport molecules. 

20 SEQ ID NO:318, SEQ ED NO:319, SEQ ED NO:320, SEQ ED NO:321, SEQ ID NO:322, 

SEQ ED NO.-323, SEQ ED NO:324, SEQ ED NO:325, SEQ ID NO:326, and SEQ ED NO:327, encoded 
bySEQIDNO:107, SEQEDNO:108, SEQEDNO:109, SEQEDNO:110, SEQEDNO:lll, SEQ ID 
NO:l 12, SEQ ID NO:l 13, SEQ ID NO:l 14, SEQ ED NO:l 15, and SEQ ED NO.l 16, respectively, are, 
for example, protein modification and maintenance molecules. 

25 SEQ ID NO:328, SEQ ID NO:329, SEQ ED NO:330, SEQ ID NO:331, SEQ ED NO:332, 

SEQ ED NO:333, SEQ ED N0.334, SEQ ID NO:335, SEQ ED N0.336, SEQ ED NO:337, SEQ ED 
NO:338, SEQ ED NO:339, SEQ ED NO:340, and SEQ ID NO:341, encoded by SEQ ID NO:117, SEQ 
ED NO.118, SEQ ID N0.119, SEQ ED NO.120, SEQ ED N0.121, SEQ EDNO:122, SEQ ED NO:123, 
SEQ ED N0.124, SEQ ID NO:125, SEQ ED N0.126, SEQ ED NO:127, SEQ ED NO:128, SEQ ED 

30 NO:129, and SEQ ED NO:130, respectively, are, for example, nucleic acid synthesis and modification 
molecules. 

SEQ ID NO:342, encoded by SEQ ED NO:131 is, for example, an adhesion molecule. 
SEQ ED NO:343, SEQ ED NO:344, SEQ ED NO:345, SEQ ED NO:346, SEQ ED N0.347, 
SEQ ED NO:348, and SEQ ID NO:349, encoded by SEQ ED NO:132, SEQ ED NO:133, SEQ ED 
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NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ ID NO:137, and SEQ ID NO:138, respectively, are, 
for example, antigen recognition molecules. ' 

SEQ ID NO:350, SEQ ID NO:35 1 , SEQ ID NO:352, and SEQ ID NO:353, encoded by SEQ 
ID NO:139, SEQ ID NO:140, SEQ ID NO:141, and SEQ ID NO:142, respectively, are, for example, 
5 electron transfer associated molecules. 

SEQ ID NO:354, SEQ ID NO:355, SEQ ID NO:356, SEQ ID NO:357, SEQ ID NO:358, and 
SEQ ID NO:359, encoded by SEQ ID NO:143, SEQ ID NO.144, SEQ ID NO: 145, SEQ ID NO:146, 
SEQ ID NO: 147, and SEQ ID NO: 148, respectively, are, for example, secreted/extracellular matrix 
molecules. 

10 SEQIDNO:360,SEQIDNO:361,SEQIDNO:362,SEQIDNO:363,SEQIDNO:364, 

SEQ ID NO:365, SEQ ID NO:366, SEQ ID NO:367, SEQ ID NO:368, and SEQ ID NO:369, encoded 
by SEQ ID NO:149, SEQ ID NO:150, SEQ ID NO:151, SEQ ID NO:152, SEQ ID NO:153, SEQ ID 
NO:154, SEQ ID NO:155, SEQ ID NO:156, SEQ ED NO:157, and SEQ ID NO:158, respectively, 

, are, for example, cytoskeletal molecules. 

15 SEQ ID NO.-370, SEQ ID NO:371, SEQ ID NO:372, and SEQ ID NO:373, encoded by SEQ 

ID NO:159, SEQ ID NO:160, SEQ ID NO:161, and SEQ ID NO:162, respectively, are, for example, 
cell membrane molecules. 

SEQ ID NO:374, SEQ ID NO:375, SEQ ID NO:376, SEQ ID NO:377, SEQ ID NO:378, 
SEQ ID NO:379, SEQ ID NO:380, SEQ ID NO:381> SEQ ID NO:382, SEQ ID NO:383, SEQ ID 

20 NO:384, SEQ ID NO:385, SEQ ID NO:386, SEQ ID NO:387, SEQ ID NO:388, SEQ ID NO:389, 
SEQ ID NO:390, SEQ ID NO:391, and SEQ ID NO:392, encoded by SEQ ID NO: 163, SEQ ID 
NO:164, SEQ ID NO:165, SEQ BO NO:166, SEQ ID NO:167, SEQ ID NO:168, SEQ ID NO.169, 
SEQ ID NO:170, SEQ ID NO:171, SEQ ID NO.172, SEQ ID NO:173, SEQ ID NO.174, SEQ ID 
NO:175, SEQ ID NO:176, SEQ ID NO:177, SEQ ID NO:178, SEQ ID NO:179, SEQ ID NO:180, 

2 5 and SEQ ID NO: 181, respectively, are, for example, ribosomal molecules. 

SEQ ID NO:393, SEQ ID NO:394, SEQ ID NO:395, SEQ ID NO:396, SEQ ID NO:397, 
SEQ ID NO:398, SEQ ID NO:399, SEQ ID NO:400, SEQ ED NO:401, SEQ ID NO:402, and SEQ ED 
NO:403, encoded by SEQ ED NO:182, SEQ ID NO:183, SEQ ED NO:184, SEQ ID NO:185, SEQ ED 
NO:186, SEQ ED NO.187, SEQ ED NO:188, SEQ ED NO:189, SEQ ID NO:190, SEQ ID NO:191, 

3 o and SEQ ED NO: 1 92, respectively, are, for example, organelle associated molecules. 

SEQ ED NO:404, SEQ ED NO:405, SEQ ID NO:406, SEQ ED NO:407, SEQ ED NO:408, 
SEQ ID NO:409, SEQ ED NO:410, SEQ ID NO:411, SEQ ED NO:412, SEQ ED NO:413, and SEQ ED 
NO:414, encoded by SEQ ED NO:193, SEQ ID NO:194, SEQ ED NO:l95, SEQ ED NO:196, SEQ ID 
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NO:197, SEQ ID NO:198, SEQ ID NO:I99, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, 
and SEQ ID NO:203, respectively, are,- for example, biochemical pathway molecules. 

SEQ ID NO:415, SEQ ID NO:416, SEQ ID NO:417, SEQ ID NO:418, SEQ ID NO:419, 
SEQ ID NO:420, SEQ ID NO:421, and SEQ ID NO:422, encoded by SEQ ID NO:204, SEQ ID 
5 NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, 
and SEQ ID NO:21 1 , respectively, are, for example, molecules, associated with growth and 
development 

i « 

Se quences of Human Diagnostic and Therapeutic Molecules 

10 The dithp of the present invention may be used for a variety of diagnostic and therapeutic 

purposes. For example, a ditbp may be used to diagnose a particular condition, disease, or disorder 
associated with human molecules. Such conditions, diseases, and disorders include, but are not limited 
to, a cell proliferative disorder, such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, 
cirrhosis, hepatitis, mixed connective tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal 

15 hemoglobinuria, polycythemia vera, psoriasis, primary thrombocythemia, and cancers including 
adenocarcinoma, leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in 
particular, a cancer of the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, gall 
bladder, ganglia, gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, parathyroid, 
penis, prostate, saljvaiy glands, skin, spleen, testis, thymus, thyroid, and uterus; an 

2 o autoimmune/inflammatory disorder, such as inflammation, actinic keratosis, acquired 

immunodeficiency syndrome (AIDS), Addison's disease, adult respiratory distress syndrome, . 
allergies, ankylosing spondylitis, amyloidosis, anemia, arteriosclerosis, asthma, atherosclerosis, 
autoimmune hemolytic anemia, autoimmune thyroiditis, bronchitis, bursitis, cholecystitis, cirrhosis, 
contact dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, diabetes mellitus, 

2 5 emphysema, erythroblastosis fetalis, erythema nodosum, atrophic gastritis, glomerulonephritis, 

Goodpasture's syndrome, gout, Graves' disease, Hashimoto's thyroiditis, paroxysmal nocturnal 
hemoglobinuria, hepatitis, hypereosinophilia, irritable bowel syndrome, episodic lymphopenia with 
lymphocytotoxins, mixed connective tissue disease (MCTD), multiple sclerosis, myasthenia gravis, 
myocardial or pericardial inflammation, myelofibrosis, osteoarthritis, osteoporosis, pancreatitis, 

3 o polycythemia vera, polymyositis, psoriasis, Reiter's syndrome, rheumatoid arthritis, scleroderma, 

Sjogren's syndrome, systemic anaphylaxis, systemic lupus erythematosus, systemic sclerosis, primary 
thrombocythemia, thrombocytopenic purpura, ulcerative colitis, uveitis, Werner syndrome, 
complications of cancer, hemodialysis, and extracorporeal circulation, trauma, and hematopoietic 
cancer including lymphoma, leukemia, and myeloma; an infection caused by a viral agent classified 
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as adenovirus, arenavirus, bunyavirus, calicivirus, coronavirus, filovirus, hepadnavirus, herpesvirus, 
flavivirus, orthomyxovirus, parvovirus, papbvavirus, paramyxovirus, picornavirus, poxvirus, reovirus, 
retrovirus, rhabdovirus, or togavirus; an infection caused by a bacterial agent classified as 
pneumococcus, staphylococcus, streptococcus, bacillus, corynebacterium, Clostridium, 
5 meningococcus, gonococcus, listeria, moraxella, kingella, haemophilus, legionella, bordetella, gram- 
negative enterobacterium including shigella, salmonella, or Campylobacter, pseudomonas, vibrio, 
brucella, francisella, yersinia, bartonelia, norcardium, actinomyces, mycobacteriiim, spirochaetale, 
rickettsia, chlamydia, or mycoplasma; an infection caused by a fungal agent classified as aspergillus, 
blastomyces, dermatophytes, cryptococcus, coccidioides, malasezzia, histoplasma, or other mycosis- 

10 causing fungal agent; and an infection caused by a parasite classified as Plasmodium or malaria- 
causing, parasitic entamoeba, leishmania, trypanosoma, toxoplasma, Pneumocystis carinii, intestinal 
protozoa such as giardia, trichomonas, tissue nematode such as trichinella, intestinal nematode such 
as ascaris, lymphatic filarial nematode, trematode such as schistosoma, and cestrode such as 

, tapeworm; a developmental disorder such as renal tubular acidosis, anemia, Cushing's syndrome, 

is achondroplastic dwarfism, Duchenne and Becker muscular dystrophy, epilepsy, gonadal dysgenesis, 
WAGR syndrome (Wilms' tumor, aniridia, genitourinary abnormalities, and mental retardation), 
Smith-Magenis syndrome, myelodysplastic syndrome, hereditary mucoepithelial dysplasia, hereditary 
keratodermas, hereditary neuropathies such as Charcot-Marie-Tooth disease and neurofibromatosis, 
hypothyroidism, hydrocephalus, seizure disorders such as Syndenham's chorea and cerebral palsy, 

20 spina bifida, anencephaly, craniorachischisis, congenital glaucoma, cataract, and sensorineural hearing 
loss; an endocrine disorder such as a disorder of the hypothalamus and/or pituitary resulting from 
lesions such as a primary brain tumor, adenoma, infarction associated with pregnancy, 
hypophysectomy, aneurysm, vascular malformation, thrombosis, infection, immunological disorder, and 
complication due to head trauma; a disorder associated with hypopituitarism including hypogonadism, 

25 Sheehan syndrome, diabetes insipidus, Kallman's disease, Hand-Schuller-Christian disease, Letterer- 
Siwe disease, sarcoidosis, empty sella syndrome, and dwarfism; a disorder associated with 
hyperpituitarism including acromegaly, giantism, and syndrome of inappropriate antidiuretic hormone 
(ADH) secretion (SIADH) often caused by benign adenoma; a disorder associated with hypothyroidism 
including goiter, myxedema, acute thyroiditis associated with bacterial infection, subacute thyroiditis 

30 associated with viral infection, autoimmune thyroiditis (Hashimoto's disease), and cretinism; a disorder 
associated with hypothyroidism including thyrotoxicosis and its various forms, Grave's disease, 
pretibial myxedema, toxic multinodular goiter, thyroid carcinoma, and Plummer's disease; a disorder 
associated with hyperparathyroidism including Conn disease (chronic hypercalemia); a pancreatic 
disorder such as Type I or Type II diabetes mellitus and associated complications; a disorder associated 
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with the adrenals such as hyperplasia,, carcinoma, or adenoma of the adrenal cortex, hypotension 
associated with alkalosis, amyloidosis, hypokalemia, Cushing's disease, Liddle's syndrome, and 
Arnold-Healy-Gordon syndrome, pheochromocytoma tumors, and Addison's disease; a disorder 
associated with t gonadal steroid hormones such as: in women, abnormal prolactin production, 
5 infertility, endometriosis, perturbation of the menstrual cycle, polycystic ovarian disease, 

hyperprolactinemia, isolated gonadotropin deficiency, amenorrhea, galactorrhea, hermaphroditism, 
hirsutism and virilization, breast' cancer, and, in postmenopausal women, osteoporosis; and, in men, 
Leydig cell deficiency, male climacteric phase, and germinal cell aplasia, a hypergonadal disorder 
associated with Leydig cell tumors, androgen resistance associated with absence of androgen receptors, 

10 syndrome of 5 a-reductase, and gynecomastia; a metabolic disorder such as Addison's 'disease, 

cerebrotendinous xanthomatosis, congenital adrenal hyperplasia, coumarin resistance, cystic fibrosis, 
diabetes, fatty hepatocirrhosis, fructose- 1,6-diphosphatase deficiency, galactosemia, goiter, 
glucagonoma, glycogen storage diseases, hereditary fructose intolerance, hyperadrenalism, 
hypoadrenalism, hyperparathyroidism, hypoparathyroidism, hypercholesterolemia, hyperthyroidism, 

1 5 hypoglycemia, hypothyroidism, hyperlipidemia, hyperlipemia, lipid myopathies, lipodystrophies, 
lysosomal storage diseases, mannosidosis, neuraminidase deficiency, obesity, pentosuria 
phenylketonuria, pseudovitamin D-deficiency rickets; disorders of carbohydrate metabolism such as 
congenital type II dyserythropoietic anemia, diabetes, insulin-dependent diabetes mellitus, 
non-insulin-dependent diabetes mellitus, fructose- 1 ,6-diphosphatase deficiency, galactosemia, 

20 glucagonoma, hereditary fructose intolerance, hypoglycemia, mannosidosis, neuraminidase 

deficiency, obesity, galactose epimerase deficiency, glycogen storage diseases, lysosomal storage 
diseases, fructosuria, pentosuria, and inherited abnormalities of pyruvate metabolism; disorders of 
lipid metabolism such as fatty liver, cholestasis, primary biliary cirrhosis, carnitine deficiency, 
carnitine palmitoyltransferase deficiency, myoadenylate deaminase deficiency, hypertriglyceridemia, 

25 lipid storage disorders such Fabry's disease, Gaucher^ disease, Niemann-Pick's disease, 
metachromatic leukodystrophy, adrenoleukodystrophy, GIV^ gangliosidosis, and ceroid 
lipofuscinosis, abetalipoproteinemia, Tangier disease, hyperlipoproteinemia, diabetes mellitus, 
lipodystrophy, lipomatoses, acute panniculitis, disseminated fat necrosis, adiposis dolorosa, lipoid 
adrenal hyperplasia, minimal change disease, lipomas, atherosclerosis, hypercholesterolemia, 

3 o hypercholesterolemia with hypertriglyceridemia, primary hypoalphalipoproteinemia, hypothyroidism, 
renal disease, liver disease, letithinrcholesterol acyltransferase deficiency, cerebrotendinous 
xanthomatosis, sitosterolemia, hypocholesterolemia, Tay-Sachs disease, Sandhoffs disease, 
hyperlipidemia, hyperlipemia, lipid myopathies, and obesity; and disorders of copper metabolism 
such as Menke's disease, Wilson's disease, and Ehlers-Danlos syndrome type IX; a neurological 
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disorder such as epilepsy, ischemic cerebrovascular disease, stroke, cerebral neoplasms, Alzheimer's 
disease, Pick's disease, Huntington's disease, dementia, Parkinson's disease and other extrapyramidal 
disorders, amyotrophic lateral sclerosis and other motor neuron disorders, progressive neural muscular 
atrophy, retinitis pigmentosa, hereditary ataxias, multiple sclerosis and other demyelinating diseases, 
5 bacterial and viral meningitis, brain abscess, subdural empyema, epidural abscess, suppurative 
intracranial thrombophlebitis, myelitis and radiculitis, viral central nervous system disease, prion 
diseases including kuru, Creutzfeldt- Jakob disease, and Gerstmann-Straussler-Scheinker syndrome, 
fatal familial insomnia, nutritional and metabolic diseases of the nervous system, neurofibromatosis, 
tuberous sclerosis, cerebelloretinal hemangioblastomatosis, encephalotrigeminal syndrome, mental 

10 retardation and other developmental disorder of the central nervous system, cerebral palsy, a 

neuroskeletal disorder, an autonomic nervous system disorder, a cranial nerve disorder, a spinal cord 
disease, muscular dystrophy and other neuromuscular disorder, a peripheral nervous system disorder, 
dermatomyositis and polymyositis, inherited, metabolic, endocrine, and toxic myopathy, myasthenia 
gravis, periodic paralysis, a mental disorder including mood, anxiety, and schizophrenic disorders, 

15 seasonal affective disorder (SAD), akathesia, amnesia, catatonia, diabetic neuropathy, tardive 
dyskinesia, dystonias, paranoid psychoses, postherpetic neuralgia, and Tourette's disorder; a 
gastrointestinal disorder including ulcerative colitis, gastric and duodenal ulcers, cystinuria, 
dibasicaminoaciduria, hypercystinuria, lysinuria, hartnup disease, tryptophan malabsorption, 
methionine malabsorption, histidinuria, iminoglycinuria, dicarboxylicaminoaciduria, cystinosis, renal 

20 glycosuria, hypouricemia, familial hypophophatemic rickets, congenital chloridonhea, distal renal 
tubular acidosis, Menkes' disease, Wilson's disease, lethal diarrhea, juvenile pernicious anemia, 
folate malabsorption, adrenoleukodystrophy, hereditary myoglobinuria, and Zellweger syndrome; a 
transport disorder such as akinesia, amyotrophic lateral sclerosis, ataxia telangiectasia, cystic fibrosis, 
Becker's muscular dystrophy, Bell's palsy, Charcot-Marie Tooth disease, diabetes mellitus, diabetes 

25 insipidus, diabetic neuropathy, Duchenne muscular dystrophy, hyperkalemic periodic paralysis, 

normokalemic periodic paralysis, Parkinson's disease, malignant hyperthermia, multidrug resistance, 
myasthenia gravis, myotonic dystrophy, catatonia, tardive dyskinesia, dystonias, peripheral 
neuropathy, cerebral neoplasms, prostate cancer, cardiac disorders associated with transport, e.g., 
angina, bradyanythmia, tachyarrythmia, hypertension, Long QT syndrome, myocarditis, 

30 cardiomyopathy, nemaline myopathy, centronuclear myopathy, lipid myopathy, mitochondrial 
myopathy, thyrotoxic myopathy, ethanol myopathy, dermatomyositis, inclusion body myositis, 
infectious myositis, and polymyositis, neurological disorders associated with transport, e.g., 
Alzheimer's disease, amnesia, bipolar disorder, dementia, depression, epilepsy, Tourette's disorder, 
paranoid psychoses, and schizophrenia, and other disorders associated with transport, e.g., 
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neurofibromatosis, postherpetic neuralgia, trigeminal neuropathy, sarcoidosis, sickle cell anemia, 
cataracts, infertility, pulmonary artery stenosis, sensorineural autosomal deafness, hyperglycemia, 
hypoglycemia, Grave's disease, goiter, glucose-galactose malabsorption syndrome, 
hypercholesterolemia, Gushing' s disease, and Addison's disease; and a connective tissue disorder 
5 such as osteogenesis imperfecta, Ehlers-Danlos syndrome, chondrodysplasias, Marfan syndrome, 
Alport syndrome, familial aortic aneurysm, achondroplasia, mucopolysaccharidoses, osteoporosis, , 
osteopetrosis, Paget' s disease, rickets, osteomalacia, hyperparathyroidism, renal osteodystrophy, 
osteonecrosis, osteomyelitis, osteoma, osteoid osteoma, osteoblastoma, osteosarcoma, 
osteochondroma, chondroma, chondroblastoma, chondromyxoid fibroma, chondrosarcoma, fibrous 

10 cortical defect, nonossifying fibroma, fibrous dysplasia, fibrosarcoma, malignant fibrous 

histiocytoma, Ewing's sarcoma, primitive neuroectodermal tumor, giant cell tumor, osteoarthritis, 
rheumatoid arthritis, ankylosing spondylarthritis, Reiter's syndrome, psoriatic arthritis, enteropathic 
arthritis, infectious arthritis, gout, gouty arthritis, calcium pyrophosphate crystal deposition disease, 
ganglion, synovial cyst, villonodular synovitis; systemic sclerosis, Dupuytren's contracture, hepatic . 

as fibrosis, lupus erythematosus, mixed connective tissue disease, epidermolysis bullosa simplex, bullous 
congenital ichthyosiform erythroderma (epidennolytic hyperkeratosis), nori-qridermolytic and 
epidermolytic palmpplantar keratoderma, ichthyosis bullosa of Siemens, pachyonychia congenita, and 
white sponge nevus. The dithp can be used to detect the presence of, or to quantify the amount of, a 
dithp-rdated polynucleotide in a sample. This information is then compared to information obtained 

20 from appropriate reference samples, and a diagnosis is established. Alternatively, a polynucleotide 
complementary to a given dithp can inhibit or inactivate a therapeutically relevant gene related to the 
dithp. 

Analysis of dithp Expression Patterns 

2 5 The expression of dithp may be routinely assessed by hybridization-based methods to 

determine, for example, the tissue-specificity, disease-specificity, or developmental stage-specificity of 
dithp expression. For example, the level of expression of dithp may be compared among different cell 
types or tissues, among diseased and normal cell types or tissues, among cell types or tissues at 
different developmental stages, or among cell types or tissues undergoing various treatments. This type 

30 of analysis is useful, for example, to assess the relative levels of dithp expression in fully or partially 
differentiated cells or tissues, to determine if changes in dithp expression levels are correlated with the 
development or progression of specific disease states, and to assess the response of a cell or tissue to a 
specific therapy, for example, in pharmacological or toxicological studies. Methods for the analysis of 
dithp expression are based on hybridization and amplification technologies and include membrane- 
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based procedures such as northern blot analysis, high-throughput procedures that utilize, for example, 
microarrays, and PCR-based procedures. ' 

Hybridization and Genetic Analysis 
5 The dithp, their fragments, or complementary sequences, may be used to identify the presence 

of and/or to determine the degree of similarity between two (or more) nucleic acid sequences. The dithp 
may be hybridized to naturally occurring or recombinant nucleic acid sequences under appropriately 
selected temperatures and salt concentrations. Hybridization with a probe based on the nucleic acid 
sequence of at least one of the dithp allows for the detection of nucleic acid sequences, including 

1 o genomic sequences, which are identical or related to the dithp of the Sequence Listing. Probes may be 

selected from non-conserved or unique regions of at least one of the polynucleotides of SEQ ID NO:l- 
21 1 and tested for their ability to identify or amplify the target nucleic acid sequence using standard 
protocols. 

•i. 

Polynucleotide sequences that are capable of hybridizing, in particular, to those shown in SEQ 
15 ID NO:l-21 1 and fragments thereof, can be identified using various conditions of stringency. (See, 
e.g„ Wahl, G.M. and S.L. Berger (1987) Methods Enzymol. 152:399-407; Kimmel, AR. (1987) 
Methods Enzymol. 152:507-511.) Hybridization conditions are discussed in "Definitions," 

A probe for use in Southern or northern hybridization may be derived from a fragment of a 
dithp sequence, or its complement, that is up to several hundred nucleotides in length and is either 

2 o single-stranded or double-stranded. Such probes may be hybridized in solution to biological materials 

such as plasmids, bacterial, yeast, or human artificial chromosomes, cleared or sectioned tissues, or to 
artificial substrates containing dithp. Microarrays are particularly suitable for identifying the presence 
of and detecting the level of expression for multiple genes of interest by examining gene expression 
correlated with, e.g., various stages of development, treatment with a drug or compound, or disease 

2 5 progression. An array analogous to a dot or slot blot may be used to arrange and link polynucleotides 
to the surface of a substrate using one or more of the following: mechanical <vacuum), chemical, 
thermal, or UV bonding procedures. Such an array may contain any number of dithp and may be 
produced by hand or by using available devices, materials, and machines. 

Microarrays may be prepared, used, and analyzed using methods known in the art. (See, e.g., 

30 Brennan, T.M. et al. (1995) U.S. Patent No. 5,474,796; Schena, M. et aL (1996) Proc. Natl. Acad. ScL 
USA 93:10614-10619; Baldeschweiler et al. (1995) PCT application W095/251 1 16; Shalon, D. et al. 
(1995) PCT application WO95/35505; Heller, R.A et al. (1997) Proc. Natl. Acad. ScL USA 94:2150- 
2155; and Heller, M.J. et al. (1997) U.S. Patent No. 5,605,662.) 

Probes may be labeled by either PCR or enzymatic techniques using a variety of -commercially 
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available reporter molecules. For example, commercial kits are available for radioactive and 
chemiluminescent labeling (Amersham Pharmacia Biotech) and for alkaline phosphatase labeling (Life 
Technologies). Alternatively, dithp may be cloned into commercially available vectors for the 
production of RNA probes. Such probes may be transcribed in the presence of at least one labeled 
5 nucleotide (e.g„ 32 P-ATP, Amersham Pharmacia Biotech). 

Additionally the polynucleotides of SEQ ID NO:i-21 1 or suitable fragments thereof can be 
used to isolate full length cDNA sequences utilizing hybridization and/or amplification procedures wdl 
known in the art, e.g., cDNA library screening, PCR amplification, etc. The molecular cloning of such 
full length cDNA sequences may employ the method of cDNA library screening with probes using the 
10 hybridization, stringency, washing, and probing strategies described above and in Ausubd, supra . . 
Chapters 3, 5, and 6. These procedures may also be employed with genomic libraries to isolate 
genomic sequences of ( dithp in order to analyze, e.g., regulatory elements. 

1 i 

Genetic Mapp in g 

15 Gene identification and mapping are important in the investigation and treatment of almost all 

conditions, diseases, and disorders. Cancer, cardiovascular disease, Alzheimer's disease, arthritis, 
diabetes, and mental illnesses are of particular interest Each of these conditions is more complex than 
the single gene defects of sickle cell anemia or cystic fibrosis, with select groups of genes being 
predictive of predisposition for a particular condition, disease, or disorder. For example, 

2 o cardiovascular disease may result from malfunctioning receptor molecules that fail to clear cholesterol 
from the bloodstream, and diabetes may result when a particular individual's immune system is 
activated by an infection and attacks the insulin-producing cells of the pancreas. In some studies, 
Alzheimer's disease has been linked to a gene on chromosome 21 ; other studies predict a different gene 
and location. Mapping of disease genes is a complex and reiterative process and generally proceeds 

2 5 from genetic linkage analysis to physical mapping. 

As a condition is noted among members of a family, a genetic linkage map traces parts of 
chromosomes that are inherited in the same pattern as the condition. Statistics link the inheritance of 
particular conditions to particular regions of chromosomes, as defined by RFLP or other markers. 
(See, for example, Lander, E. S. and Botstein, D. (1986) Proc. Natl. Acad. ScL USA 83:7353-7357.) 

3 o Occasionally, genetic markers and their locations are known from previous studies. More often, 

however, the markers are simply stretches of DNA that differ among individuals. Examples of genetic 
linkage maps can be found in various scientific journals or at the Online Mendelian Inheritance in Man 
(OMIM) World Wide Web site. 

In another embodiment of the invention, dithp sequences may be used to generate hybridization 
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probes useful in chromosomal mapping of naturally occurring genomic sequences. Either coding or 
noncoding sequences of dithp may be used, and in some instances, noncoding sequences may be 
preferable over coding sequences.. For example, conservation of a dithp coding sequence among 
members of a multtgene family may potentially cause undesired cross hybridization during 
5 chromosomal mapping. The sequences may be mapped to a particular chromosome, to a specific region 
of a chromosome, or to artificial chromosome constructions, e.g., human artificial chromosomes 
(HACs), yeast artificial chromosomes (YACs), bacterial artificial chromosomes (BACs), bacterial PI 
constructions, or single chromosome cDNA libraries. (See, e.g., Harrington, J. J. et al. (1997) Nat 
Genet. 15:345-355; Price, CM. (1993) Blood Rev. 7:127-134; and Trask, B.J. (1991) Trends Genet 
10 7:149-154.) 

Fluorescent in situ hybridization (FISH) may be correlated with other physical chromosome 
mapping techniques and genetic map data. (See, e.g., Meyers, supra , pp. 965-968.) Correlation 
between the location of dithp on a physical chromosomal map and a specific disorder, or a 
predisposition to a specific disorder, may help define the region of DNA associated with that disorder. 

15 The dithp sequences may also be used to detect polymorphisms that are genetically linked to the 
inheritance of a particular condition, disease, or disorder. 

In situ hybridization of chromosomal preparations and genetic mapping techniques, such as 
linkage analysis using established chromosomal markers, may be used for extending existing genetic 
maps. Often the placement of a gene on the chromosome of another mammalian species, such as 

2 o mouse, may reveal associated markers even if the number or arm of the corresponding human 

chromosome is not known. These new marker sequences can be mapped to human chromosomes and 
may provide valuable information to investigators searching for disease genes using positional cloning 
or other gene discovery techniques. Once a disease or syndrome has been crudely correlated by genetic 
linkage with a particular genomic region, e.g., ataxia-telangiectasia to 1 lq22-23, any sequences 

25 mapping to that area may represent associated or regulatory genes for further investigatioa (See, e.g., 
Gatti, R.A. et al. (1988) Nature 336:577-580.) The nucleotide sequences of the subject invention may 
also be used to detect differences in chromosomal architecture due to translocation, inversion, etc., 
among normal, carrier, or affected individuals. 

Once a disease-associated gene is mapped to a chromosomal region, the gene must be cloned in 

30 order to identify mutations or other alterations (e,g., translocations or inversions) that may be correlated 
with disease. This process requires a physical map of the chromosomal region containing the disease- 
gene of interest along with associated markers. A physical map is necessary for determining the 
nucleotide sequence of and order of marker genes on a particular chromosomal region. Physical 
mapping techniques are well known in the art and require the generation of overlapping sets of cloned 
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DNA fragments from a particular organelle, chromosome, or genome. These clones are analyzed to 
reconstruct and catalog their order. Once the position of a marker is determined, the DNA from that 
region is obtained by consulting the catalog and selecting clones from that regioa The gene of interest 
is located through positional cloning techniques using hybridization or similar methods. 

5 

Diagnostic Uses 

The dithp of the present invention may be used to design probes useful in diagnostic assays. 
Such assays, well known to those skilled in the art, may be used to detect or confirm conditions, 
disorders, or diseases associated with abnormal levels of dithp expression. Labeled probes developed 
l o from dithp sequences are added to a sample under hybridizing conditions of desired stringency. In some 
instances, dithp, or fragments or oligonucleotides derived from dithp, may be vised as primers in 
amplification steps prior to hybridization. Hie amount of hybridization complex formed is quantified 
and compared with standards for that cell or tissue. If dithp expression varies significantly from the 
standard, the assay indicates the presence of the condition, disorder, or disease. Qualitative or 
il5 quantitative diagnostic methods may include northern, dot blot, or other membrane or dip-stick based 
technologies or multiple-sample format technologies such as PCR, enzyme-linked immunosorbent assay 
(ELISA)-like, pin, or chip-based assays. 

The probes described above may also be used to monitor the progress of conditions, disorders, 
or diseases associated with abnormal levels of dithp expression, or to evaluate the efficacy of a 
, 20 particular therapeutic treatment The candidate probe may be identified from the dithp that are specific 
to a given human tissue and have not been observed in GenBank or other genome databases. Such a 
. probe may be used in animal studies, preclinical tests, clinical trials, or in monitoring the treatment of 
an individual patient In a typical process, standard expression is established by methods well known in 
the art for use as a basis of comparison, samples from patients affected by the disorder or disease are 
2 5 combined with the probe to evaluate any deviation from the standard profile, and a therapeutic agent is 
administered and effects are monitored to generate a treatment profile. Efficacy is evaluated by 
determining whether the expression progresses toward or returns to the standard normal pattern. 
Treatment profiles may be generated ova* a period of several days or several months. Statistical 
methods well known to those skilled in the art may be use to determine the significance of such 
30 therapeutic agents. 

The polynucleotides are also useful for identifying individuals from minute biological samples, 
for example, by matching the RFLP pattern of a sample's DNA to that of an individual's DNA. The 
polynucleotides of the present invention can also be used to determine the actual base-by-base DNA 
sequence of selected portions of an individual's genome. These sequences can be used to prepare PCR 
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primers for amplifying and isolating such selected DNA, which can then be sequenced Using this 
technique, an individual can be identified through a unique set of DNA sequences. Once a unique ID 
database is established for an individual, positive identification of that individual can be made from 
extremely small tissue samples. 
5 In a particular aspect, oligonucleotide primers derived from the dithp of the invention may be 

used to detect single nucleotide polymorphisms (SNPs). SNPs are substitutions, insertions and 
deletions that are a frequent cause of inherited or acquired genetic disease in humans. Methods of SNP 
detection include, but are not limited to, single-stranded conformation polymorphism (SSCP) and 
fluorescent SSCP (fSSCP) methods. In SSCP, oligonucleotide primers derived from dithp are used to 

l o amplify DNA using the polymerase chain reaction (PCR). The DNA may be derived, for example, 
from diseased or normal tissue, biopsy samples, bodily fluids, and the like. SNPs in the DNA cause 
differences in the secondary and tertiary structures of PCR products in single-stranded form, and these 
differences are detectable using gel electrophoresis in non-denaturing gels. In fSCCP, the 
oligonucleotide primers are fluorescently labeled, which allows detection of the amplimers in high- - 

15 throughput equipment such as DNA sequencing machines. Additionally, sequence database analysis 
methods, termed in silico SNP (isSNP), are capable of identifying polymorphisms by comparing the 
sequences of individual overlapping DNA fragments which assemble into a common consensus 
sequence. These computer-based methods filter out sequence variations due to laboratory preparation 
of DNA and sequencing errors using statistical models and automated analyses of DNA sequence 

20 chromatograms. In the alternative, SNPs may be detected and characterized by mass spectrometry 
using, for example, the high throughput MASSARRAY system (Sequenom, Inc., San Diego CA). 

DNA-based identification techniques are critical in forensic technology. DNA sequences taken 
from very small biological samples such as tissues, e.g., hair or skin, or body fluids, e.g., blood, saliva, 
semen, etc., can be amplified using, e.g„ PCR, to identify individuals. <See, e.g., Erlich, H. (1992) 

25 PCR Technology , Freeman and Co., New York, NY). Similarly, polynucleotides of the present 
invention can be used as polymorphic markers. 

There is also a need for reagents capable of identifying the source of a particular tissue. 
Appropriate reagents can comprise, for example, DNA probes or primers prepared from the sequences 
of the present invention that are specific for particular tissues. Panels of such reagents can identify 

3 o tissue by species and/or by organ type. In a similar fashion, these reagents can be used to screen tissue 
cultures for contamination. 

The polynucleotides of the present invention can also be used as molecular weight markers on 
nucleic acid gels or Southern blots, as diagnostic probes for the presence of a specific mRNA in a 
particular ceil type, in the creation of subtracted cDN A libraries which aid in the discovery of novel 
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polynucleotides, in selection and synthesis of oligomers for attachment to an array or other support, and 

! 

as an antigen to elicit an immune response. 

i 

Disease Model Systems Using ditto 
5 The dithp of the invention or their mammalian homoiogs may be "knocked out" in an animal 

model system using homologous recombination in embryonic stem (ES) cells. Such techniques are well 
known in the art and are useful for the generation of animal models of human disease, (See, e.g., U.S. 
Patent Number 5,175,383 and U.S. Patent Number 5,767,337.) For exan^le* mouse ES ceils, such as 
the mouse 129/SvJ cell line, are derived from the early mouse embryo and grown in culture. Hie ES 

10 cells are transformed with a vector containing the gene of interest disrupted by a marker gene* e.g„ the 
neomycin phosphotransferase gene (neo; Capecchi, M.R. (1989) Science 244:1288-1292). The vector 
integrates into the corresponding region of the host genome by homologous recombination. 
Alternatively, homologous recombination takes place using the Cre-loxP system to knockout a gene of 
interest in a tissue- or developmental stage-specific manna* (Marth, J.D. (1996) Clin. Invest 97:1999- 

,15 2002; Wagner, K.U. et al. (1997) Nucleic Acids Res. 25:4323-4330). Transformed ES cells are 

identified and microinjected into mouse cell blastocysts such as those from the C57BL/6 mouse strain. 
The blastocysts are surgically transferred to pseudopregnant dams, and the resulting chimeric progeny 
are genotyped and bred to produce heterozygous or homozygous strains. Transgenic animals thus 
generated may be tested with potential therapeutic or toxic agents. 

20 The dithp of the invention may also be manipulated in vitro in ES cells derived from human 

blastocysts. Human ES cells have the potential to differentiate into at least eight separate cell lineages 
including endoderm, mesoderm, and ectodermal cell types. These cell lineages differentiate into, for 
example, neural cells, hematopoietic lineages, and cardiomyocytes (Thomson, J.A. et al. (1998) Science 
282:1145-1147). 

2 5 The dithp of the invention can also be used to create "knockin" humanized animals (pigs) or 

transgenic animals (mice or rats) to model human disease. With knockin technology, a region of dithp 
is injected into animal ES cells, and the injected sequence integrates into the animal cell genome. 
Transformed cells are injected into blastulae, and the blastulae are implanted as described above. 
Transgenic progeny or inbred lines are studied and treated with potential pharmaceutical agents to 

3 o obtain information on treatment of a human disease. Alternatively, a mammal inbred to overexpress 

dithp, resulting, e.g., in the secretion of DITHP in its milk, may also serve as a convenient source of 
that protein (Janne, J. et al. (1998) Biotechnol. Annu. Rev. 4:55-74). 

Screening Assays 
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DITHP encoded by polynucleotides of the present invention may be used' to screen for 
molecules that bind to or are bound by the encoded polypeptides. The binding of the polypeptide and 
the molecule may activate (agonist), increase, inhibit (antagonist), or decrease activity of the 
polypeptide or the bound molecule. Examples of such molecules include antibodies, oligonucleotides, 
5 proteins (e.g., receptors), or small molecules. 

Preferably, the molecule is closely related to the natural ligand of the polypeptide, e.g., a ligand 
or fragment thereof, a natural substrate, or a structural or functional mimetic. (See, Coligan et al., 
(1991) Current Protocols in Immunology 1(2): Chapter 5.) Similarly, the molecule can be closely 
related to the natural receptor to which the polypeptide binds, or to at least a fragment of the receptor, 

1 o e.g., , the active site. In either case, the molecule can be rationally designed using known techniques. 

Preferably, the screening for these molecules involves producing appropriate cells which express the 
polypeptide, either as a secreted protein or on the cell membrane. Preferred cells include cells from 
mammals, yeast, Drosophila . or E. coli . Cells expressing the polypeptide or cell membrane fractions 
which contain the expressed polypeptide are then contacted with a test compound and binding, 
15 stimulation, or inhibition of activity of either the polypeptide or the molecule is analyzed. 

An assay may simply test binding of a candidate compound to the polypeptide, wherein binding 
is detected by a fluorophore, radioisotope, enzyme conjugate, or other detectable label. Alternatively, 
the assay may assess binding in the presence of a labeled competitor. 

Additionally, the assay can be carried out using cell-free preparations, polypeptide/molecule 

2 o affixed to a solid support, chemical libraries, or natural product mixtures. The assay may also, simply 

comprise the steps of mixing a candidate compound with a solution containing a polypeptide, measuring 
polypeptide/molecule activity or binding, and comparing the polypeptide/molecule activity or binding to 
a standard. 

Preferably, an ELISA assay using, e>g., a monoclonal or polyclonal antibody, can measure 

2 5 polypeptide level in a sample. The antibody can measure polypeptide level by either binding, directly or 

indirectly, to the polypeptide or by competing with the polypeptide for a substrate* 

All of the above assays can be used in a diagnostic or prognostic context. The molecules 
discovered using these assays can be used to treat disease or to bring about a particular result in a 
patient (e.g., blood vessel growth) by activating or inhibiting the polypeptide/molecule. Moreover, the 

3 o assays can discover agents which may inhibit or enhance the production of the polypeptide from 

suitably manipulated cells or tissues. 



Transcript Imaging and Toxicological Testing 

Another embodiment relates to the use of dithp to develop a transcript image of a tissue or cell 

145 



BNSOOCID: <WO 01 62927A2J_> 



WO 01/62927 



PCT/US01/06059 



type. A transcript image represents the global pattern of gene expression by a particular tissue or cell 
type. Global gene expression patterns are analyzed by quantifying the number of expressed genes and 
their relative abundance under given conditions and at a given time. (See Seflhamer et al., 
"Comparative Gene Transcript Analysis," U.S. Patent Number 5,840,484, expressly incorporated by 
5 reference herein.) Thus a transcript image may be generated by hybridizing the polynucleotides of the 
present invention or their complements to the totality of transcripts or reverse transcripts of a particular 
tissue or cell type. In one embodiment, the hybridization takes place, in high-throughput format, 
wherein the polynucleotides of the present invention or their complements comprise a subset of a 
plurality of elements on a microarray. The resultant transcript image would provide a profile of gene 
10 activity pertaining to human molecules for diagnostics and therapeutics. 1 

Transcript images which profile dithp expression may be generated using transcripts isolated 

from tissues, cell lines, biopsies, or other biological samples. The transcript image may thus reflect 

i 

dithp expression in vivo , as in the case of a tissue or biopsy sample, or in vitro , as in the case of a cell 
line. 

1 5 Transcript images which profile dithp expression may also be used in conjunction with in vitro 

model systems and preclinical evaluation of pharmaceuticals, as well as toxicological testing of 
industrial and naturally-occurring environmental compounds. All compounds induce characteristic 
gene expression pattens, frequently termed molecular fingerprints or toxicant signatures, which are 
indicative of mechanisms of action and toxicity (Nuwaysir, E. F. et al. (1999) Mol. Carcinbg. 24:153- 

20 159; Steiner, S. and Anderson, N.L. (2000) Toxicol. Lett. 112-113:467-71, expressly incorporated by 
reference herein). If a test compound has a signature similar to that of a compound with known 
toxicity, it is likely to share those toxic properties. These fingerprints or signatures are most useful and 
refined when they contain expression information from a large number of genes and gene families. 
Ideally, a genome- wide measurement of expression provides the highest quality signature. Even genes 

2 5 whose expression is not altered by any tested compounds are important as well, as the levels of 

expression of these genes are used to normalize the rest of the expression data. The normalization 
procedure is useful for comparison of expression data after treatment with different compounds. While 
the assignment of gene function to elements of a toxicant signature aids in interpretation of toxicity 
mechanisms, knowledge of gene function is not necessary for the statistical matching of signatures 

3 o which leads to prediction of toxicity. (See, for example, Press Release 00-02 from the National 

Institute of Environmental Health Sciences, released February 29, 2000, available at 
httpy/www.indis.nih.gov/oc/news/toxchip.htm.) Therefore, it is important and desirable in 
toxicological screening using toxicant signatures to include all expressed gene sequences. 

In one embodiment, the toxicity of a test compound is assessed by treating a biological sample 
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containing nucleic acids, with the test compound. Nucleic acids that are expressed in the treated 
biological sample are hybridized with, one or more probes specific to the polynucleotides of the 
present invention, sq that transcript levels corresponding to the polynucleotides of the present 
invention may be quantified. The transcript levels in the treated biological sample are compared with 

5 levels in an untreated biological sample. Differences in the transcript levels between the two samples 

i 1 

are indicative of a toxic response caused by the test compound in the treated sample. 

Another particular embodiment relates to the use of DITHP encoded by polynucleotides of the 
present invention to analyze the proteome of a tissue or cell type. The term proteome refers to the 
global pattern of protein expression in a particular tissue or cell type. Each protein component of a 

l o proteome can be subjected individually to further analysis. Proteome expression patterns, or profiles, 
are analyzed by quantifying the number of expressed proteins and their relative abundance under given 
conditions and at a given time. A profile of a cell* s proteome may thus be generated by separating and 
analyzing the polypeptides of a particular tissue or cell type. In one embodiment, the separation is 
achieved using two-dimensional gel electrophoresis, in which proteins from a sample are separated by 

15 isoelectric focusing in the first dimension, and then according to molecular weight by sodium dodecyi 
sulfate slab gel electrophoresis in the second dimension (Steiner and Anderson, supra) . The proteins are 
visualized in the gel as discrete and uniquely positioned spots, typically by staining the gd with an 
agent such as Coomassie Blue or silver or fluorescent stains. The optical density of each protein spot is 
generally proportional to the level of the protein in the sample. The optical densities of equivalently 

20 positioned protein spots from different samples, for example, from biological samples either treated or 
untreated with a test compound or therapeutic agent, are compared to identify any changes in protein 
spot density related to the treatment The proteins in the spots are partially sequenced using, for 
example, standard methods employing chemical or enzymatic cleavage followed by mass spectrometry. 
The identity of the protein in a spot may be determined by comparing its partial sequence, preferably of 
.25 at least 5 contiguous amino acid residues, to the polypeptide sequences of the present invention. In 
some cases, further sequence data may be obtained for definitive protein identification. 

A proteomic profile may also be generated using antibodies specific for DITHP to quantify the 
levels of DITHP expressioa In one embodiment, the antibodies are used as elements on a microarray, 
and protein expression levels are quantified by exposing the microarray to the sample and detecting the 

3 o levels of protein bound to each array element (Lueking, A. et al. (1999) Anal. Biochem. 270:103-1 1 ; 
Mendoze, L.G. et al. (1999) Biotechniques 27:778-88). Detection may be performed by a variety of 
methods known in the art, for example, by reacting the proteins in the sample with a thiol- or amino- 
reactive fluorescent compound and detecting the amount of fluorescence bound at each array element 

Toxicant signatures at the proteome level are also useful for toxicological screening, and should 
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be analyzed in parallel with toxicant signatures at the transcript level. There is a poor correlation 
between transcript and protein abundances for some proteins in some tissues (Anderson, N.L. and 
Seflhamer, J. (1997) Electrophoresis 18:533-537), so proteome toxicant signatures may be useful in the 
analysis of compounds which do not significantly, affect the transcript image> but which alter the 
5 proteomic profile. In addition, the analysis of transcripts in body fluids is difficult, due to rapid 
degradation of mRNA, so proteomic profiling may be more reliable and informative in such cases. 
, In another embodiment, the toxicity of a test compound is assessed by treating a biological 

sample containing proteins with the test compound. Proteins that are expressed in the treated biological 
sample are separated so that the amount of each protein can be quantified. The amount of each protein 
10 is compared to the amount of the corresponding protein in an untreated biological sample. A difference 
in the amount of protein between the two samples is indicative of a toxic response to the test compound 
in the treated sample. Individual proteins are identified by sequencing the amino acid residues of the 
individual proteins and comparing these partial sequences to the DITHP encoded by polynucleotides of 
the present invention. 

15 In another embodiment, the toxicity of a test compound is assessed by treating a biological 

sample containing proteins with the test compound. Proteins from the biological sample are incubated 
with antibodies specific to the DITHP encoded by polynucleotides of the present invention. The amount 
of protein recognized by the antibodies is quantified. The amount of protein in the treated biological 
sample is compared with the amount in an untreated biological sample. A difference in the amount of 

20 protein between the two samples is indicative of a toxic response to the tester 
sample. 

Transcript images may be used to profile dithp expression in distinct tissue types. This process 
can be used to determine human molecule activity in a particular tissue type relative to this activity in a 
different tissue type. Transcript images may be used to generate a profile of dithp expression 

2 5 characteristic of diseased tissue. Transcript images of tissues before and after treatment may be used 

for diagnostic purposes, to monitor the progression of disease, and to monitor the efficacy of drug 
treatments for diseases which affect the activity of human molecules. 

Transcript images of ceil lines can be used to assess human molecule activity and/or to identify 
cell lines that lack or misregulate this activity. Such cell lines may then be treated with pharmaceutical 

3 o agents, and a transcript image following treatment may indicate the efficacy of these agents in restoring 

desired levels of this activity. A similar approach may be used to assess the toxicity of pharmaceutical 
agents as reflected by undesirable changes in human molecule activity. Candidate pharmaceutical 
agents may be evaluated by comparing their associated transcript images with those of pharmaceutical 
agents of known effectiveness. 
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Antisense Molecules 

The polynucleotides of the present invention are useful in antisense technology. Antisense 
technology or therapy relies on the modulation of expression of a target protein through the specific 
binding of an antisense sequence to a target sequence encoding the target protein or directing its 
5 expression. (See, e.g., Agrawal, S., ed (1996) Antisense Therapeutics, Humana Press Inc., Totawa 
NJ; Alama, A. et al. (1997) Pharmacol. Res. 36(3):171-178; Crooke, S.T. (1997) Adv. Pharmacol. 
40:1-49; Sharma, H.W. andR. Narayanan (1995) Bioessays 17(12):1055-1063; andLavrosky, Y. et 
al. (1997) Biochem. Mol. Med. 62(l):ll-22.) An antisense sequence is a polynucleotide sequence 
capable of specifically hybridizing to at least a portion of the target sequence. Antisense sequences 

10 bind to cellular mRNA and/or genomic DNA, affecting translation and/or transcription. Antisense 
sequences can be DNA, RNA, or nucleic acid mimics and analogs. (See, e.g., Rossi, J.J. et al. (1991) 
Antisense Res. Dev. l(3):285-288; Lee, R. et al. (1998) Biochemistry 37(3):900-1010; Pardridge, 
W.M. et al. (1995) Proc. Natl. Acad. Sci. USA 92(12):5592-5596; and Nielsen, P. E. and Haaima, G. 

, (1997) Chem. Soc. Rev. 96:73-78.) Typically, the binding which results in modulation of expression 

1 5 occurs through hybridization or binding of complementary base pairs. Antisense sequences can also 
bind to DNA duplexes through specific interactions in the major groove of the double helix. 

The polynucleotides of the present invention and fragments thereof can be used as antisense 
sequences to modify the expression of the polypeptide encoded by dithp. The antisense sequences can 
be produced ex vivo , such as by using any of the ABI nucleic acid synthesizer series (Applied 

2 o Biosystems) or other automated systems known in the art Antisense sequences can also be produced 
biologically, such as by transforming an appropriate host cell with an expression vector containing the 
sequence of interest. (See, e.g„ Agrawal, supra .) 

In therapeutic use, any gene delivery system suitable for introduction of the antisense sequences 
into appropriate target cells can be used. Antisense sequences can be delivered intracellularly in the 

2 5 form of an expression plasmid which, upon transcription, produces a sequence complementary to at 

least a portion of the cellular sequence encoding the target proteia (See, e.g., Slater, J.E., et al. (1998) 
J. Allergy Clin. Immunol. 102(3):469-475; and Scanlon, KJ., et al. (1995) 9(13):1288-1296.) 
Antisense sequences can also be introduced intracellularly through the use of viral vectors, such as 
retrovirus and adeno- associated virus vectors. (See, e.g., Miller, A.D. (1990) Blood 76:271; Ausubel, 

3 o F.M. et al. (1995) Current Protocols in Molecular Biology. John Wiley & Sons, New York NY; Ucfcert, 

W. and W. Walther (1994) Pharmacol. Ther. 63(3):323-347.) Other gene delivery mechanisms include 
liposome-derived systems, artificial viral envelopes, and other systems known in the art. (See, eg., 
Rossi, J.J. (1995) Br. Med. Bull. 51(l):217-225; Boado, R.J. et al. (1998) J. Pharm. ScL 87(11):1308- 
1315; and Morris, M.C. et al. (1997) Nucleic Acids Res. 25(14):2730-2736.) 
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Expression 

In order to express a biologically active DITHP, the nucleotide sequences encoding DITHP or 
fragments thereof may be inserted into an appropriate expression vector, i.e., a vector which contains 
the necessary elements for transcriptional and translational control of the inserted coding sequence in a 
5 suitable host Methods which are well known to those skilled in the art may be used to construct 
expression vectors containing sequences encoding DITHP and appropriate transcriptional and 
translational control elements. These methods include in vitro recombinant DNA techniques, synthetic 
techniques, and in vivo genetic recombinatioa (See, e.g:, Sambrook, supra . Chapters 4, 8, 16, and 17; 
and Ausubel, supra . Chapters 9, 10, 13, and 16.) ' , 

10 A variety of expression vector/host systems may be utilized to contain and express sequences 

encoding DITHP. These include, but are not limited to, microorganisms such as bacteria transformed 
with recombinant bacteriophage, plasmid, or cosmid DNA expression vectors; yeast transformed with 
yeast expression vectors; insect cell systems infected with viral expression vectors (e.gj, baculovirus); 
plant cell systems transformed with viral expression vectors (e.g., cauliflower mosaic virus, CaMV, or 

15 tobacco mosaic virus, TMV) or with bacterial expression vectors (e.g., H or pBR322 plasmids); or 
animal (mammalian) cell systems. (See> e.g., Sambrook, supra ; Ausubel, 1995, supra. Van Heeke, G. 
and S.M. Schuster (1989) J. Biol. Chem. 264:5503-5509; Bitter, G.A. et al. (1987) Methods Enzymol. 
153:516-544; Scorer, C.A. et al. (1994) Bio/Technology 12:181-184; Engelhard, E.K. et al. (1994) 
Proc. Nafl. Acad. Sci. USA 91:3224^3227; Sandig, V. et al. (1996) Hum. Gene Ther; 7:1937-1945; 

20 Takamatsu, N. (1987) EMBO J. 6:307-311; Coruzzi, G. et al. (1984) EMBO J. 3:1671-1680; Broglie, 
R. et al. (1984) Science 224:838-843; Winter, J. et al. (1991) Results Probl. Cell Differ. 17:85-105; 
The McGraw Hill Yearbook of Science and Technology (1992) McGraw Hill, New York NY, pp. 
191-196; Logan, J. and T. Shenk (1984) Proc. Natl. Acad. Sci. USA 81:3655-3659; and Harrington, 
J J. et al. (1997) Nat Genet. 15:345-355.) Expression vectors derived from retroviruses, adenoviruses, 

25 or herpes or vaccinia viruses, or from various bacterial plasmids, may be used for delivery of nucleotide 
sequences to the targeted organ, tissue* or cell population. (See, eg., Di Nicola, M. et al. (1998) 
Cancer Gen. Ther. 5(6):350-356; Yu, M. et al., (1993) Proc. Nati. Acad. Sci. USA 90(13):6340-6344; 
Buller, R.M. et al. (1985) Nature 317(6040):813-815; McGregor, D.P. et al. (1994) Mol. Immunol. 
31(3):219-226; and Verma, I.M. and N. Somia (1997) Nature 389:239-242.) The invention is not 

3 o limited by the host cell employed. 

For long team production of recombinant proteins in mammalian systems, stable expression of 
DITHP in cell lines is preferred. For example, sequences encoding DITHP can be transformed into ceil 
lines using expression vectors which may contain viral origins of replication and/or endogenous 
expression elements and a selectable marker gene on the same or on a separate vector. Any number of 
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selection systems may be used to recover transformed cell lines. (See, e.g., Wigjer, M. et al. (1977) 
Cell 11:223-232; Lowy, I. et al. (1980) Cell' 22:817-823.; Wigler, M. et al. (1980) Proc. Natl. Acad. 
ScL USA 77:3567-3570; Colbere-Garapin, F. et al. (1981) J. Mol. Biol. 150:1-14; Hartman, S.C. and 
R.C.Mulligan (1988) Proc. NatL Acad. Sci. USA 85:8047-8051; Rhodes, C.A. (1995) Methods Mol. 
5 Biol. 55:121-131.) 

Therapeutic Uses of dithp 

The dithp of the invention may be used for somatic or germline gene therapy. Gene therapy 
may be performed to (i) correct a genetic deficiency (e.g., in the cases of severe combined 

10 immunodeficiency (SCH>)-X1 disease characterized by X-linked inheritance (Cavazzana-Calvo, M. et 
al. (2000) Science 288:669-672), severe combined immunodeficiency syndrome associated with an 
inherited adenosine deaminase (ADA) deficiency (Blaese, R.M. et al. (1995) Science 270:475-480; 
Bordignon, C. et al. (1995) Science 270:470-475), cystic fibrosis (Zabner, J. et al. (1993) Cell 75:207- 
216; Crystal, R.G. et al. (1995) Hum. Gene Therapy 6:643-666; Crystal, R.G. et al. (1995) Hum. Gene 

is Therapy 6:667-703), thalassemias, familial hypercholesterolemia, and hemophilia resulting from Factor 
Vm or Factor DC deficiencies (Crystal, R.G. (1995) Science 270:404-410; Venna, I.M. and Somia, N. 
(1997) Nature 389:239-242)), (ii) express a conditionally lethal gene product (e.g., in the case of 
cancers which result from unregulated cell proliferation), or (iii) express a protein which affords • 
protection against intracellular parasites (e.g., against human retroviruses, such as human 

20 immunodeficiency virus (HIV) (Baltimore, D. (1988) Nature 335:395-396; PoescMa, E. et al. (1996) 
Proc. Natt. Acad. Sci. USA. 93:11395-11399), hepatitis B or C virus (HBV, HCV); fungal parasites, 
such as Candida albicans and Paracoccidioides brasiliensis : and protozoan parasites such as 
Plasmodium falciparum and Trypanosoma cruzi) . In the case where a genetic deficiency in dithp 
expression or regulation causes disease, the expression of dithp from an appropriate population of 

25 transduced cells may alleviate the clinical manifestations caused by the genetic deficiency. 

In a further embodiment of the invention, diseases or disorders caused by deficiencies in dithp 
are treated by constructing mammalian expression vectors comprising dithp and introducing these 
vectors by mechanical means into dithp-deficient cells. Mechanical transfer technologies for use with 
cells in vivo or ex vitro include (i) direct DNA microinjection into individual cells, (ii) ballistic gold 

3 o particle delivery, (iii) liposome-mediated transfection, (iv) receptor-mediated gene transfer, and (v) the 
use of DNAtransposons (Morgan, R.A. and Anderson, W.F. (1993) Annu. Rev. Biochem. 62:191-217; 
Ivies, Z. (1997) Cell 91:501-510; Boulay, J-L. and R&ipon, H. (1998) Curr. Opin. Biotechnol. 9:445- 
450). 

Expression vectors that may be effective for the expression of dithp include, but are not limited 
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to, the PCDNA 3.1, EPITAG, PRCCMV2, PREP, PVAX vectors (Invitrogen, Carlsbad CA), 
PCMV-SCRIPT, PCMV-TAG, PEGSH/PERV (Stratagene, La Jolla CA), and PTET-OFF, 
PTET-ON, PTRE2, PTRE2-LUC, PTK-HYG (Clontech, Palo Alto CA); The dithp of the invention 
may be expressed using (i) a constitutively active promoter, (e.g., from cytomegalovirus (CMV), Rous 
5 sarcoma virus (RSV), SV40 virus, thymidine kinase (TK), or p-actin genes), (ii) an inducible promote 
(e.g., the tetracycline-regulated promote (Gossen, M. andBujai;d, H. (1992) Proc. NatL Acad. Sci. 
U.S.A. 895547-5551; Gossen, t/L et al., (1995) Science 268:1766-1769; Rossi, F.M.V. and Blau, 
H.M. (1998) Curr. Opia Biotechnol. 9:45 1-456), commercially available in the T-REX plasmid 
(Invitrogen); the ecdysone-inducible promote (available in theplasmids PVGRXR and PIND; 

10 Invitrogen); the FK506/rapamycin inducible promoter; or the RU486/mifepristone inducible promote 
(Rossi, F.M.V. and Blau, H.M. supra), or (iii) a tissue-specific promote or the native promote of the 
endogenous gene encoding DITHP from a normal individual. 

Commercially available liposome transformation kits, (e.g., the PERFECT LIPID 
TRANSFECTION KIT, available from Invitrogen) allow one with ordinary skill in the art to deliver 

15 polynucleotides to target cells in culture and require minimal effort to optimize experimental 
parameters. In the alternative, transformation is performed using the calcium phosphate method 
(Graham, FX. and Eb, AJ. (1973) Virology 52:456-467), or by electroporation (Neumann, E. et al. 
(1982) EMBO J. 1 :841-845). The introduction of DNA to primary cells requires modification of these 
standardized mammalian transfection protocols. 

20 In another embodiment of the invention, diseases or disorders caused by genetic defects with 

respect to dithp expression are treated by constructing a retrovirus vector consisting of (i) dithp under 
the control of an independent promote or the retrovirus long terminal repeat (LTR) promote, (ii) 
appropriate RNA packaging signals, and (iii) a Rev-responsive element (RRE) along with additional 
retrovirus cw-acting RNA sequences and coding sequences required for efficient vector propagatioa 

25 Retrovirus vectors (e.g., PFB and PFBNEO) are commercially available (Stratagene) and are based on 
published data (Riviere, I. et al. (1995) Proc. Natl. Acad. Sci. U.S. A 92:6733-6737), incorporated by 
reference herein. The vector is propagated in an appropriate vector producing cell line (VPCL)that 
expresses an envelope gene with a tropism for receptors on the target cells or a promiscuous envelope 
protein such as VSVg (Armentano, D. et al. (1987) J. Virol. 61:1647-1650; Bender, M.A et al. (1987) 

30 J. Virol. 61:1639-1646; Adam, M.A and Miller, AD. (1988) J. Virol. 62:3802-3806; Dull, T. et al. 
(1998) J. Virol. 72:8463-8471 ; Zufferey, R. et al (1998) J. Virol. 72:9873-9880). U.S. Patent Number 
5,910,434 to Rigg ("Method for obtaining retrovirus packaging cell lines producing high transducing 
efficiency retroviral supernatant'*) discloses a method for obtaining retrovirus packaging cell lines and 
is hereby incorporated by reference. Propagation of retrovirus vectors, transduction of a population of 
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cells (eg., CD4* T-cells), and the return of transduced cells to a patient are procedures well known to 
persons skilled in the art of gene therapy and have been well documented (Ranga, U. et al. (1997) J. 
Virol. 71:7020-7029; Bauer, G. et al. (1997) Blood 89:2259-2267; Bonyhadi, MX. (1997) J. Virol. 
71:4707-4716; Ranga, U. et al. (1998) Proc. Natl. Acad. Sci. U.S.A. 95:1201-1206; Su, L. (1997) 
5 Blood 89:2283-2290). 

In the alternative, an adenovirus-based gene therapy delivery system is used to deliver dithp to 

1 i ■ 

cells which have one or more genetic abnormalities with respect to the expression of dithp. The 
construction and packaging of adenovirus-based vectors are well known to those with ordinary skill in 
the art. Replication defective adenovirus vectors have proven to be versatile for importing genes 

10 encoding immunoregulatory proteins into intact islets in the pancreas (Csete, M.E. et al. (1995) 
Transplantation 27:263-268). Potentially useful adenoviral vectors are described in U.S. Patent 
Number 5,707,618 to Armentano ("Adenovirus vectors for gene therapy"), hereby incorporated by 
reference. For adenoviral vectors, see also Antinozzi, P.A. et al. (1999) Annu. Rev. Nutr. 19:511-544 
and Verma, I.M. and Somia, N. (1997) Nature 18:389:239-242, both incorporated by reference herein. 

15 In another alternative, a herpes-based, gene therapy delivery system is used to deliver dithp to 

target cells which have one or more genetic abnormalities with respect to the expression of dithp. The 
use of hopes simplex virus (HS V)-based vectors may be especially valuable for introducing dithp to 
cells of the central nervous system, for which HSV has a tropism. The construction and packaging of 
herpes-based vectors are well known to those with ordinary skill in the art A replication-competent 

20 herpes simplex virus (HSV) type 1 -based vector has been used to deliver a reporter gene to the eyes of 
primates (Liu, X. et al. (1999) Exp. Eye Res.l69:385-395). The construction of a HSV-1 virus vector 
has also been disclosed in detail in U.S. Patent Number 5,804,413 to DeLuca ("Herpes simplex virus 
strains for gene transfer"), which is hereby incorporated by reference. U.S. Patent Number 5,804,41 3 
teaches the use of recombinant HSV d92 which consists of a genome containing at least one exogenous 

2 5 gene to be transferred to a cell under the control of the appropriate promoter for purposes including 
human gene therapy. Also taught by this patent are the construction and use of recombinant HSV 
strains deleted for ICP4, ICP27 and ICP22. For HSV vectors, see also Goins, W. F. et al. 1999 J. 
Virol. 73:519-532 and Xu, H. et al., (1994) Dev. Biol. 163:152-161, hereby incorporated by reference. 
The manipulation of cloned herpesvirus sequences, the generation of recombinant virus following the 

30 transfection of multiple plasmids containing different segments of the large herpesvirus genomes, the 
growth and propagation of herpesvirus, and the infection of cells with hopes virus are techniques well 
known to those of ordinary skill in the art 

In another alternative, an alphavirus (positive, single-stranded RN A virus) vector is used to 
deliver dithp to target cells. The biology of the prototypic alphavirus, SemKM Forest Virus (SFV), has 
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been studied extensively and gene transfer vectors have been based on the SFV genome (Garoff, H. and 
Li, K-J. (1998) Curr. Opin. Biotech 9:464-469). During alphavirus RNA replication, a subgenomic 
RNA is generated that normally encodes the viral capsid proteins. This subgenomic RNA replicates to 
higher levels than the full-length genomic RNA, resulting in the overproduction of capsid protons 
5 relative to the viral proteins with enzymatic activity (e.g., protease and polymerase). Similarly, 

inserting dithp into the alphavirus genome in place of the capsid-coding region results in the production 
' of a large number of dithp RNAs and the synthesis of high levels of DITHP in vector transduced cells. 

While alphavirus infection is typically associated with cell lysis within a few days, the ability to 
establish a persistent infection in hamster normal kidney cells (BHK-21) with a variant of Sindbis virus 

l o (SIN) indicates that the lytic replication of alphaviruses can be altered to suit the needs of the gene 
therapy application (Dryga, S.A. et al. (1997) Virology 228:74-83). The wide host range of 
alphaviruses will allow the introduction of dithp into a variety of cell types. The specific transduction 
of a subset of cells in a population may require the sorting of cells prior to transduction. The methods 
of manipulating infectious cDNA clones of alphaviruses, performing alphavirus cDNA and RNA 

15 transfections, and performing alphavirus infections, are well known to those with ordinary skill in the 
art. 

, Antibodies 

Anti-DITHP antibodies may be used to analyze protein expression levels. Such antibodies 
20 include, but are not limited to, polyclonal, monoclonal, chimeric, single chain, and Fab fragments. For 
descriptions of and protocols of antibody technologies, see, eg., Pound J.D. (1998) Immunochemical 
Protocols . Humana Press, Totowa, NJ. 

The amino acid sequence encoded by the dithp of the Sequence Listing may be analyzed by 
appropriate software (e.g., LASERGENE NAVIGATOR software, DNASTAR) to determine regions 
25 of high immunogenicity. The optimal sequences for immunization are selected from the C-terminus, the 
N-terminus, and those intervening, hydrophilic regions of the polypeptide which are likely to be exposed 
to the external environment when the polypeptide is in its natural conformation. Analysis used to select 
appropriate epitopes is also described by Ausubel (1997, supra , Chapter 11.7). Peptides used for 
antibody induction do not need to have biological activity; however, they must be antigenic. Peptides 
30 used to induce specific antibodies may have an amino acid sequence consisting of at least five amino 
acids, preferably at least 10 amino acids, and most preferably at least 15 amino acids. A peptide which 
mimics an antigenic fragment of the natural polypeptide may be fused with another protein such as 
keyhole limpet hemocyanin (KLH; Sigma, St Louis MO) for antibody production. A peptide 
encompassing an antigenic region may be expressed from a dithp, synthesized as described above, or 
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purified from human ceils. 

Procedures well known in the art may be used for the production of antibodies. Various hosts 
including mice* goats, and rabbits, may be immunized by injection with a peptide. Depending on the 
host species, various adjuvants may be used to increase immunological response. 
5 In one procedure, peptides about 15 residues in length may be synthesized using an ABI 431 A 

peptide synthesizer (Applied Biosystems) using fmoc-chemistry and coupled to KLH (Sigma) by 
reaction with M-maleimidobenzoyl-N-hydroxysuccinimide ester (Ausubel, 1995, supra) . Rabbits are 
immunized with the peptide-KLH complex in complete Freund's adjuvant The resulting antisera are 
tested for antipeptide activity by binding the peptide to plastic, blocking with 1% bovine serum albumin 

10 (BSA), reacting with rabbit antisera, washing, and reacting with radioiodinated goat anti-rabbit IgG. 
Antisera with antipeptide activity aire tested for anti-DITHP activity using protocols well known in the 
art, including ELISA, radioimmunoassay (RIA), and immunoblotting. 

In another procedure, isolated and purified peptide may be used to immunize mice (about 100 
|ig of peptide) or rabbits (about 1 mg of peptide). Subsequently, the peptide is radioiodinated and used 

15 to screen the immunized animals' B -lymphocytes for production of antipeptide antibodies. Positive 
cells are then used to produce hybridomas using standard techniques. About 20 mg of peptide is 
sufficient for labeling and screening several thousand clones. Hybridomas of interest are detected by 
screening with radioiodinated peptide to identity those fusions producing peptide-specific monoclonal 
antibody. In a typical protocol, wells of a multi-well plate (FAST, Becton-Dickinson, Palo Alto, CA) 

20 are coated with affinity-purified, specific rabbit-anti-mouse (or suitable anti-species IgG) antibodies at 
10 mg/ml. The coated wells are blocked with 1% BSA and washed and exposed to supernatants from 
hybridomas. After incubation, the wells are exposed to radiolabeled peptide at 1 mg/ml. 

Clones producing antibodies bind a quantity of labeled peptide that is detectable above 

i 

background. Such clones are expanded and subjected to 2 cycles of cloning. Cloned hybridomas are 

2 5 injected into pristane-treated mice to produce ascites, and monoclonal antibody is purified from the 

ascitic fluid by affinity chromatography on protein A (Amersham Pharmacia Biotech). Several 
procedures for the production of monoclonal antibodies, including in vitro production, are described in 
Pound (supra) . Monoclonal antibodies with antipeptide activity are tested for anti-DITHP activity 
using protocols well known in the art, including ELISA, RIA, and immunoblotting. 

3 o Antibody fragments containing specific binding sites for an epitope may also be generated For 

example, such fragments include, but are not limited to, the F(ab*)2 fragments produced by pepsin 
digestion of the antibody molecule, and the Fab fragments generated by reducing the disulfide bridges of 
the F(ab')2 fragments. Alternatively, construction of Fab expression libraries in filamentous 
bacteriophage allows rapid and^asy identification of monoclonal fragments with desired specificity 
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(Pound, supra , Chaps. 45-47). Antibodies generated against polypeptide encoded by dithp can be used 
to purify and characterize full-length DITHP protein and its activity, binding partners, etc. 

Assays Using Antibodies 

5 Anti-DITHP antibodies may be used in assays to quantify the amount of DITHP found in a 

particular human cell. Such assays include methods utilizing the antibody and a label to detect 
expression level under normal or disease conditions. Hie peptides and antibodies of the invention may 
be used with or without modification or labeled by joining them, either covalently or noncovalently, 
with a reporter molecule. 

10 Protocols for detecting and measuring protein expression using either polyclonal or monoclonal 

antibodies are well known in the art Examples include ELIS A, RIA, and fluorescent activated cell 
sorting (FACS). Such immunoassays typically involve the formation of complexes between the DITHP 
and its specific antibody and the measurement of such complexes. These and other assays are described 
in Pound (supra) . 

1 5 Without further elaboration, it is believed that one skilled in the art can, using the preceding 

description, utilize the present invention to its fullest extent The following preferred specific 
embodiments are, therefore, to be construed as merely illustrative, and not limitative of the remainder of 
the disclosure in any way whatsoever. 

The disclosures of all patents, applications, and publications mentioned above and below, 

20 including U.S. Ser. No. 60/184,777, U.S. Ser. No. 60/184,797, U.S. Ser. No. 60/184,698, U.S. Ser. 
No. 60/184,770, U.S. Ser. No. 60/184,774, U.S. Ser. No. 60/184,693, U.S. Ser. No. 60/184,771,U.S. 
Ser. No. 60/184,813, U.S. Ser. No. 60/184,773, U.S. Ser. No. 60/184,776, U.S. Ser. No. 60/184,769, 
U.S. Ser. No. 60/184,768, U.S. Ser. No. 60/184,837, U.S. Ser. No. 60/184,697, U.S. Ser. No. 
60/184,841, U.S. Ser. No. 60/184,772, U.S. Ser. No. 60/185,213, U.S. Ser. No. 60/185,216, U.S. Ser. 

25 No. 60/204,863, U.S. Ser. No. 60/205,221, U.S. Ser. No. 60/204,815, U.S. Ser. No. 60/203,785, U.S. 
So". No. 60/204,821, U.S. Ser. No. 60/204,908, U.S. Sear. No. 60/204,226, U.S. Ser. No. 60/204,525, 
U.S. Ser. No. 60/205,285, U.S. Ser. No. 60/205,232, U.S. Ser. No. 60/205,323, U.S. Ser. No. 
60/205,287, U.S. Ser. No. 60/205,324, and U.S. Sex. No. 60/205,286, are hereby expressly 
incorporated by reference. 

30 

EXAMPLES 

I. Construction of cDNA Libraries 

RNA was purchased from CLONTECH Laboratories, Inc. (Palo Alto CA) or isolated from 
various tissues. Some tissues were homogenized and lysed in guanidinium isothiocyanate, while others 
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were homogenized and lysed in phenol or in a suitable mixture of denaturants, such as TRIZOL (Life 
Technologies), a monophasic solution of phenol and guanidine isothiocyanate. The resulting lysates 
were centrifuged over CsCl cushions or extracted with chloroform. RNA was precipitated with either 
isopropanol or sodium acetate and ethanol, or by other routine methods. 
5 Phenol extraction and precipitation of RNA were repeated as necessary to increase RNA 

purity. In most cases, RNA was treated with DNase. For most libraries, poly(A+) RNA was isolated 
using oligo d(T)-coupled paramagnetic particles (Proraega Corporation (Promega), Madison WI), ' 
OLIGOTEX latex particles (QIAGEN, Inc. (QIAGEN), Valencia CA), or an OLIGOTEX mRNA 
purification kit (QIAGEN). Alternatively, RNA was isolated directly from tissue lysates using other 

10 RNA isolation kits, e.g., the POLY(A)PURE mRNA purification kit (Ambion, Inc., Austin TX). 

In some cases, Stratagene was provided with RNA and constructed the corresponding cDNA 
libraries. Otherwise, cDNA was synthesized and cDNA libraries were constructed with the UNIZAP 
vector system (Stratagene Cloning Systems, Inc. (Stratagene), La Jolla CA) or SUPERSCRIPT 
plasmid system (Life Technologies), using the recommended procedures or similar methods known in 

is the art (See, e.g., Ausubel, 1997, supra . Chapters 5.1 through 6.6.) Reverse transcription was 

initiated using oligo d(T) or random primers. Synthetic oligonucleotide adapters were ligated to double 
stranded cDNA, and the cDNA was digested with the appropriate restriction enzyme or enzymes'. For 
most libraries, the pDNA was size-selected (300-1000 bp) using SEPHACRYL S 1000, SEPHAROSE 
CL2B, or SEPHAROSE CL4B column chromatography (Amersham Pharmacia Biotech) or 

2 o preparative agarose gel electrophoresis. cDNAs were ligated into compatible restriction enzyme sites of 
thepolylinker of a suitable plasmid, e.g., PBLUESCRIPT plasmid (Stratagene), PSPORT1 plasmid 
(Life Technologies), PCDNA2.1 plasmid (Invitrogen, Carlsbad CA), PBK-CMV plasmid (Stratagene), 
or pINCY (Incyte Genomics, Palo Alto CA), or derivatives thereof. Recombinant plasmids were 
transformed into competent E. coli cells including XLl-Blue, XLl-BlueMRF, or SOLR from 

25 Stratagene or DH5ct, DH10B, or ElectroMAX DH10B from Life Technologies. 



II. Isolation of cDNA Clones 

Plasmids were recovered from host cells by in vivo excision using the UNIZAP vector system 
(Stratagene) or by cell lysis. Plasmids were purified using at least one of the following: the Magic or 
3 o WIZARD Minipreps DNA purification system (Promega); the AGTC Miniprep purification kit {Edge 
BioSystems, Gaithersburg MD); and the QIAWELL 8, QIAWELL 8 Plus, and QIAWELL 8 Ultra 
plasmid purification systems or the R.E.A.L. PREP 96 plasmid purification kit (QIAGEN). Following 
precipitation, plasmids were resuspended in 0.1 ml of distilled water and stored, with or without 
lyqphilization, at 4°C. 
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Alternatively, plasmid DNA was amplified from host cell lysates using direct link PCR in a 
high-throughput format (Rao, V.B. (1994) Anal. Biochem: 216:1-14.) Host cdi lysis and thermal 
cycling steps wore carried out in a single reaction mixture. Samples were processed and stored in 384- 
well plates, and the concentration of amplified plasmid DNA was quantified fluorometrically using 
PICOGREEN dye (Molecular Probes, Inc. (Molecular Probes), Eugene OR) and a FLUOROSKAN II 
fluorescence scanner (Labsystems Oy, Helsinki, Finland). 

III. Sequencing and Analysis 
cDNA sequencing reactions were processed using standard methods or high-througbput 

instrumentation such as the ABI CATALYST 800 thermal cycler (Applied Biosystems) or the PTC- 
200 thermal cycler (MJ Research) in conjunction with the HYDRA microdispenser (Robbins Scientific 
Corp., Sunnyvale CA) or the MICROLAB 2200 liquid transfer system (Hamilton), cDNA sequencing 
reactions were prepared using reagents provided by Amersham Pharmacia Biotech or supplied in ABI 
sequencing kits such as the ABI PRISM BIGDYE Terminator cycle sequencing ready reaction kit 
(Applied Biosystems). Electrophoretic separation of cDNA sequencing reactions and detection of 
labeled polynucleotides ware carried out using the MEGABACE 1000 DNA sequencing system 
(Molecular Dynamics); the ABI PRISM 373 or 377 sequencing system (Applied Biosystems) in 
conjunction with standard ABI protocols and base calling software; or other sequence analysis systems 
known in the art Reading frames within the cDN A sequences wore identified using standard methods 
(reviewed in Ausubd, 1997, supra . Chapter 7.7). Some of the cDNA sequences were selected for 
extension using the techniques disclosed in Example VOL 

IV. Assembly and Analysis of Sequences 
Component sequences from chromatograms were subject to PHRED analysis and assigned a 

2 5 quality score. The sequences having at least a required quality score ware subject to various pre- 
processing editing pathways to eliminate, e.g., low quality 3' ends, vector and linker sequences, polyA 
tails, Alu repeats, mitochondrial and ribosomal sequences, bacterial contamination sequences, and - 
sequences smaller than 50 base pairs. In particular, low-information sequences and repetitive elements 
(e.g., dinucleotide repeats, Alu repeats, etc.) ware replaced by "n's", or masked, to prevent spurious 

30 matches. 

Processed sequences were then subject to assembly procedures in which the sequences were 
assigned to gene bins (bins). Each sequence could only belong to one bin. Sequences in each gene bin 
ware assembled to produce consensus sequences (templates). Subsequent new sequences were added to 
existing bins using BLASTn (v. 1 .4 WashU) and CROSSMATCH. Candidate pairs were identified as 
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all BLAST hits having a quality score greater than or equal to 150. Alignments of at least 82% local 
identity were accepted into the bin. The component sequences from each bin were assembled using a 
version of PHRAP. Bins with several overlapping component sequences were assembled using DEEP 
PHRAP. The orientation (sense or antisense) of each assembled template was determined based on the 
5 number and orientation of its component sequences. Template sequences as disclosed in the sequence 
listing correspond to sense strand sequences (the "forward" reading frames), to the best determination. 
The complementary (antisense) strands are inherently disclosed herein. The component sequences 
which were used to assemble each template consensus sequence are listed in Table 4, along with then- 
positions along the template nucleotide sequences. 
10 Bins were compared against each other and those having local similarity of at least 82% were 

combined and reassembled. Reassembled bins having templates of insufficient overlap (less than 95% 

local identity) were re-split Assembled templates were also subject to analysis by STITCHER/EXON 

i 

MAPPER algorithms which analyze the probabilities of the presence of splice variants, alternatively 
spliced exons, splice junctions, differential expression of alternative spliced genes across tissue types or 
15 disease states, etc. These resulting bins were subject to several rounds of the above assembly 
procedures. 

Once gene bins were generated based upon sequence alignments, bins were clone joined based 
upon clone information. If the 5 f sequence of one clone was present in one bin and the 3' sequence from 
the same clone was present in a different bin, it was likely that the two bins actually belonged together 
20 in a single bin. The resulting combined bins underwent assembly procedures to regenerate the 
consensus sequences. 

The final assembled templates were subsequently annotated using the following procedure. 
Template sequences were analyzed using BLASTn (v2.0, NCBI) versus gbpri (GenBank version 120). 
"Hits" were defined as an exact match having from 95% local identity over 200 base pairs through 

2 5 1 00% local identity over 1 00 base pairs, or a homolog match having an E-value, i.a a probability 

score, of <; 1 x 10"*. The hits were subject to frameshift FASTx versus GENPEPT (GenBank version 
1 20). (See Table 7). In this analysis, a homolog match was defined as having an E-value of <s 1 x ltf 8 . 
The assembly method used above was described in "System and Methods for Analyzing Biomolecular 
Sequences," U.S.S.N. 09/276,534, filed March 25, 1999, and the LDFESEQ Gold user manual (Incyte) 

3 o both incorporated by reference herein. 

Following assembly, template sequences were subjected to motif, BLAST, and functional 
analyses, and categorized in protein hierarchies using methods described in, e.g„ "Database System 
Employing Protein Function Hierarchies for Viewing Biomolecular Sequence Data," U.S.S.N. 
08/812,290, filed March 6, 1997; "Relational Database for Storing Biomolecule Information," 
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U.S.S.N. 08/947,845, filed October 9, 1997; "Project-Based Full-Length Biomolecular Sequence 
Database," U.S.S.N. 08/811,758, filed March 6, 1997; and "Relational Database and System for 
Storing Information Relating to Biomolecular Sequences" U.S.S.N. 09/034,807, filed March 4, 1998, 
all of which are incorporated by reference herein. 
5 The template sequences' were further analyzed by translating each template in all three forward 

reading frames and searching each translation against the Pf am database of hidden Markov model- . . 
i based protein families and domains using the HMMER software package (available to the public from 

Washington University School of Medicine, St. Louis MO). Regions of templates which, when 
translated, contain similarity to Pfam consensus sequences are reported in Table 2, along with 

10 descriptions of Pfam protein domains and families. Only those Pfamhits with anE-value of *1 x 10" 3 
are reported. (See also World Wide Web site http://pfam.wustl.edu/ for detailed descriptions of Pfam 
protein domains and families.) 

Additionally, the template sequences ware translated in all three forward reading frames, and 
each translation was searched against hidden Markov models for signal peptides using the HMMER 

1 1 5 software package. Construction of hidden Markov models and their usage in sequence analysis has 
been described (See, for example, Eddy, S.R. (1996) Curr. Opin. Str. Biol. 6:361-365.) Only those 
signal peptide hits with a cutoff score of 1 1 bits or greater are reported A cutoff score of 1 1 bits or 
greater corresponds to at least about 91-94% true-positives in signal peptide prediction. Template 
sequences were also translated in all three forward reading frames, and each translation was searched* 

20 against TMAP, a program that uses weight matrices to delineate transmembrane segments on protean 
sequences and determine orientation, with respect to the cell cytosol (Persson, B. and P. Argos (1994) J. 
Mol. Biol. 237:182-192; Persson, B. and P. Argos (1996) Protein Sci. 5:363-371). Regions of 
templates which, when translated, contain similarity to signal peptide or transmembrane consensus 
sequences are reported in Table 3. 

2 5 The results of HMMER analysis as reported in Tables 2 and 3 may support the results of 

BLAST analysis as reported in Table 1 or may suggest alternative or additional properties of template- 
encoded polypeptides not previously uncovered by BLAST or other analyses. 

Template sequences are further analyzed using the bioinformatics tools listed in Table 7, or 
using sequence analysis software known in the art such as MACDNASIS PRO software (Hitachi 

3 o Software Engineering, South San Francisco CA) and LASERGENE software (DNASTAR). Template 

sequences may be farther queried against public databases such as the GenBank rodent, mammalian, 
vertebrate* prokaryote, and eukaryote databases. 

The template sequences were translated to derive the corresponding longest open reading frame 
as presented by the polypeptide sequences. Alternatively, a polypeptide of the invention may begin at 
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any of the methionine residues within the full length translated polypeptide. Polypeptide sequences 
were subsequently analyzed by querying against the GenBank protein database (GENPEPT, (GenBank 
version 121)). Full length polynucleotide sequences are also analyzed using MACDNASIS PRO 
software (Hitachi Software Engineering, South San Francisco CA) and LASERGENE software 
5 (DNASTAR). Polynucleotide and polypeptide sequence alignments are generated using default 

parameters specified by the CLUSTAL algorithm as incorporated into the MEGALIGN multisequence 
alignment program (DNASTAR), which also calculates the percent identity between aligned sequences. 

Table 6 shows sequences with homology to the polypeptides of the invention as identified by 
BLAST analysis against the GenBank protein (GENPEPT) database. Column 1 shows the polypeptide 

l o sequence identification number (SEQ ID NO:) for the polypeptide segments of the inventioa Column 2 
shows the reading frame used in the translation of the polynucleotide' sequences encoding the 
polypeptide segments. Column 3 shows the length of the translated polypeptide segments. Columns 4 
and 5 show the start and stop nucleotide positions of the polynucleotide sequences encoding the 
polypeptide segments. Column 6 shows the GenBank identification number (GI Number) of the nearest 

is GenBank homolog. Column 7 shows the probability score for the match between each polypeptide and 
its GenBank homolog. Column 8 shows the annotation of the GenBank homolog. 
V. Analysis of Polynucleotide Expression 

Northern analysis is a laboratory technique used to detect the presence of a transcript of a gene 
and involves the hybridization of a labeled nucleotide sequence to a membrane on which RNAs from a 

20 particular cell type or tissue have been bound. (See, e.g., Sambrook, supra , ch. 7; Ausubel, 1995, 
supra , ch. 4 and 16.) 

Analogous computer techniques applying BLAST were used to search for identical or related 
molecules in cDNA databases such as GenBank or LIFESEQ (Incyte Genomics). This analysis is 
much faster than multiple membrane-based hybridizations. In addition, the sensitivity of the computer 
25 search can be modified to determine whether any particular match is categorized as exact or similar. 
The basis of the search is the product score, which is defined as: 

BLAST Score x Percent Identity 

5 x minimum {length(Seq. 1), length(Seq. 2)} 

30 

The product score takes into account both the degree of similarity between two sequences and the length 
of the sequence match. The product score is a normalized value between 0 and 100, and is calculated 
as follows: the BLAST score is multiplied by the percent nucleotide identity and the product is divided 
by (5 times the length of the shorter of the two sequences). The BLAST score is calculated by 
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assigning a score of +5 for every base that matches in a high-scoring segment pair (HSP), and -4 for 
every mismatch. Two sequences may share more than one HSP (separated by gaps). If there is more 
than one HSP, then the pair with the highestBLAST score is used to calculate the product score. The 
product score represents a balance between fractional overlap and quality in a BLAST alignment For 
5 example, a product score of 100 is produced only for 100% identity over the entire length of the shorter 
of the two sequences being compared A product score of 70 is produced either by 100% identity and 
70% overlap at one end, or by 88% identity and 100% overlap at the other. A product score of 50 is 
produced either by 100% identity and 50% overlap at one end, or 79% identity and 100% overlap: 

10 VI. Tissue Distribution Profiling 1 

A tissue distribution profile is determined for each template by compiling the cDNA library 
tissue classifications of its component cDNA sequences. Each component sequence, is derived from a 
cDNA library constructed from a human tissue. Each human tissue is classified into one of the 
following categories: cardiovascular system; connective tissue; digestive system; embryonic structures; 
is endocrine system; exocrine glands; genitalia, female; genitalia, male; germ cells; hemic and immune 
system; liver; musculoskeletal system; nervous system; pancreas; respiratory system; sense organs; 
skin; stomatognathic system; unclassified/mixed; or urinary tract Template sequences, component 
sequences, and cDNA library/tissue information are found in the LIFESEQ GOLD database (Incyte 
Genomics, Palo Alto CA). 

2 o Table 5 shows the tissue distribution profile for the templates of the inventioa For each 

template, the three most frequently observed tissue categories are shown in column 3, along with the 
percentage of component sequences belonging to each category. Only tissue categories with 
percentage values of * 10% are shown. A tissue distribution of "widely distributed" in column 3 
indicates percentage values of <10% in all tissue categories. 

25 

VII. Transcript Image Analysis 

Transcript images are generated as described in Seilhamer et al„ "Comparative Gene 
Transcript Analysis/' U.S. Patent Number 5,840,484, incorporated herein by reference. 

3 o VIE. Extension of Polynucleotide Sequences and Isolation of a Full-length cDNA 

Oligonucleotide primers designed using a dithp of the Sequence Listing are used to extend the 
nucleic acid sequence. One primer is synthesized to initiate 5' extension of the template, and the other 
primer, to initiate 3* extension of the template. The initial primers may be designed using OLIGO 4.06 
software (National Biosciences, Inc. (National Biosciences), Plymouth MN), or another appropriate 
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program, to be about 22 to 30 nucleoticies in length, to have a GC content of about 50% or more, and to 
anneal to the target sequence at temperatures of about 68 °C to about 72°C. Any stretch of nucleotides 
which would result in hairpin structures and primer-primer dimerizations are avoided. Selected human 
cDNA libraries are used to extend the sequence. If more than one extension is necessary or desired, 
5 additional or nested sets of primers are designed. 

High fidelity amplification is obtained by PCR using methods well known in the art PCR is 
performed in 96-well plates using the PTC-200 thermal cycler (MJ Research). The reaction mix 
contains DNA template, 200 nmol of each prima*, reaction buffer containing Mg 2 *, (NH 4 ) 2 S0 4 , and B- 
mercaptoethanol, Taq DNA polymerase (Amersham Pharmacia Biotech), ELONGASB enzyme (Life 

10 Technologies), and Pfu DNA polymerase (Stratagene), with the following parameters for primer pair 
PCI A and PCI B: Step 1: 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, 1 min; Step 4: 68°C, 2 
min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68 ? C, 5 min; Step 7: storage at 4°C. In the 
alternative, the parameters for primer pair T7 and SK+ are as follows: Step 1: 94°C, 3 min; Step 2: 
94°C, 15 sec; Step 3: 57°C, 1 min; Step 4: 68 °C, 2 min; Step 5: Steps 2, 3, and 4 repeated 20 tinies; 

15 Step 6: 68°C, 5 min; Step 7: storage at 4°C. 

The concentration of DNA in each well is determined by dispensing 100 jil PICOGREEN 
quantitation reagent (0.25% (v/v); Molecular Probes) dissolved in IX Tris-EDTA (TE) and 0.5 \il of 
undiluted PCR product into each well of an opaque fluorimeter plate (Corning Incorporated (Corning), 
Corning NY), allowing the DNA to bind to the reagent. The plate is scanned in a FLUOROSKAN II 

20 (Labsystems Oy) to measure the fluorescence of the sample and to quantify the concentration of DNA. 
A 5 |ul to 10 ^1 aliquot of the reaction mixture is analyzed by electrophoresis on a 1 % agarose mini-gel 
to determine which reactions are successful in extending the sequence. 

The extended nucleotides are desalted and concentrated, transferred to 3 84- well plates, digested 
with CviH cholera virus endonuclease (Molecular Biology Research, Madison WI), and sonicated or 

25 sheared prior to religation into pUC 18 vector (Amersham Pharmacia Biotech). For shotgun 

sequencing, the digested nucleotides are separated on low concentration (0.6 to 0.8%) agarose gels, 
fragments are excised, and agar digested with AGAR ACE (Promega). Extended clones are religated 
using T4 ligase (New England Biolabs, Inc., Beverly MA) into pUC 18 vector (Amersham Pharmacia 
Biotech), treated with Pfu DNA polymerase (Stratagene) to fill-in restriction site overhangs, and 

30 transfected into competent E. coli cells. Transformed cells are selected on antibiotic-containing media, 
individual colonies are picked and cultured overnight at 37°C in 384-well plates in LB/2x carbenicillin 
liquid media. 

The cells are lysed, and DNA is amplified by PCR using Taq DNA polymerase (Amersham 
Pharmacia Biotech) and Pfu DNA polymerase (Stratagene) with the following parameters: Step 1: 
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94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, 1 min; Step 4: 72°C, i min; Step 5: steps 2, 3, and 4 
repeated 29 times; Step 6: 72°C, 5 min; Step 7: storage at 4°C. DNA is quantified by PICOGREEN 
reagent (Molecular Probes) as described above. Samples with low DNA recoveries are reamplified 
using the same conditions as described above. Samples are diluted with 20% dimethysulfoxide (1:2, 
5 v/v), and sequenced using DYENAMIC energy transfer sequencing primers and the DYENAMIC 
DIRECT kit (Amersham Pharmacia Biotech) or the ABI PRISM BIGDYE Terminator cycle 
sequencing ready reaction kit (Applied Biosystems). \ 

In like manner, the dithp is used to obtain regulatory sequences (promotes, introns, and 
enhancers) using the procedure above, oligonucleotides designed for such extension, and an appropriate 
10 genomic library. 

EX. Labeling of Probes and Southern Hybridization Analyses 

Hybridization probes derived from the dithp of the Sequence Listing are employed for screening 
cDNAs, mRNAs, or genomic DNA. The labeling of probe nucleotides between 100 and 1000 

as nucleotides in length is specifically described, but, essentially the same procedure may be used with 
largo: cDNA fragments. Probe sequences are labeled at room temperature for 30 minutes using a T4 
polynucleotide kinase, y^-ATP, and 0.5X One-Phor-All Plus (Amersham Pharmacia Biotech) buffer 
and purified using a ProbeQuant G-50 Microcolumn (Amersham Pharmacia Biotech). The probe 
mixture is diluted to 1 0 7 dpm/jig/ml hybridization buffer and used in a typical membrane-based 

20 hybridization analysis. 

The DNA is digested with a restriction endonuclease such as Eco RV and is dectrophoresed 
through a 0.7% agarose gel. The DNA fragments are transferred from the agarose to nylon membrane 
(NYTRAN Plus, Schleicher & Schuell, Inc., Keene NH) using procedures specified by the 
manufacturer of the membrane. Prehybridization is carried out for three or more hours at 68 °C, and 

25 hybridization is carried out overnight at 68°C. To remove non-specific signals, blots are sequentially 
washed at room temperature under increasingly stringent conditions, up to 0. lx saline sodium citrate 
(SSC) and 0.5% sodium dodecyl sulfate. After the blots are placed in a PHOSPHORIMAGER cassette 
(Molecular Dynamics) or are exposed to autoradiography film, hybridization patterns of standard and 
experimental lanes are compared. Essentially the same procedure is employed when screening RNA 

30 

X. Chromosome Mapping of dithp 

The cDNA sequences which ware used to assemble SEQ ID NO:l-211 are compared with 
sequences from the Incyte LDFESEQ database and public domain databases using BLAST and other 
implementations of the Smith-Waterman algorithm. Sequences from these databases that match SEQ 
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ID NO:l-211 are assembled into clusters of contiguous and overlapping sequences using assembly 
algorithms such as PHRAP (Table 7). Radiation hybrid and genetic mapping data available from 
public resources such as the Stanford Human Genome Center (SHGC), Whitehead Institute for Genome 
Research (WIGR), and G6n6thon are used to determine if any of the clustered sequences have been 
5 previously mapped Inclusion of a mapped sequence in a cluster will result in the assignment of all , 
sequences of that cluster, including its particular SEQ ID NO:, to that map location. The genetic map 
locations of SEQ ID NO:l-21 1 are described as ranges, or intervals, of human chromosomes. The map 
position of an interval, in centiMorgans, is measured relative to the terminus of the chromosome's p- 
arm. (The centiMorgan (cM) is a unit of measurement based on recombination frequencies between 
10 chromosomal markers. On average, 1 cM is roughly equivalent to 1 megabase (Mb) of DNA in 

humans, although this can vary widely due to hot and cold spots of recombination.) The cM distances 
are based on genetic markers mapped by G6n6thon which provide boundaries for radiation hybrid 
markers whose sequences were included in each of the clusters. 

is XI. Microarray Analysis 

Probe Preparation from Tissue or Cell Samples 

Total RNA is isolated from tissue samples using the guanidinium thiocyanate method and 
polyA* RNA is purified using the oligo ((TT) cellulose method. Each polyA + RNA sample is reverse 
transcribed using MMLV reverse-transcriptase, 0.05 pg/|il oligo-dT primer (21mer), IX first strand 

2 o buffer, 0.03 units/^1 RNase inhibitor, 500 fiM dATP, 500 ^iM dGTP, 500 nM dTTP, 40 dCTP, 
40 |iM dCTP-Cy3 (BDS) or dCTP-Cy5 (Amersham Pharmacia Biotech). The reverse transcription 
reaction is performed in a 25 ml volume containing 200 ng polyA + RNA with GEMB RIGHT kits 
(Incyte). Specific control polyA* RNAs are synthesized by in vitro transcription from non-coding yeast 
genomic DNA (W. Lea, unpublished). As quantitative controls, the control mRNAs at 0.002 ng, 0.02 

25 ng, 0.2 ng, and 2 ng are diluted into reverse transcription reaction at ratios of 1 :100,000, 1 :10,000, 
1 : 1000, 1 :100 (w/w) to sample mRNA respectively. The control mRNAs are diluted into reverse 
transcription reaction at ratios of 1:3, 3:1, 1:10, 10:1, 1:25, 25:1 (w/w) to sample mRNA differential 
expression patterns. After incubation at 37° C for 2 hr, each reaction sample (one with Cy3 and another 
with Cy5 labeling) is treated with 2.5 ml of 0.5M sodium hydroxide and incubated for 20 minutes at 

30 85° C to the stop the reaction and degrade the RNA Probes are purified using two successive 

CHROMA SPIN 30 gel filtration spin columns (CLONTECH Laboratories, Inc. (CLONTECH), Palo 
Alto CA) and after combining, both reaction samples are ethanol precipitated using 1 ml of glycogen (1 
mg/iril), 60 ml sodium acetate, and 300 ml of 100% ethanol. The probe is then dried to completion 
using a SpeedVAC (Savant Instruments Inc., Holbrook NY) and resuspended in 14 yil 5X SSC/0.2% 
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SDS. ' 

Microarrav Preparation 

Sequences of the present invention are used to generate array elements. Each array element is 
5 amplified from bacterial cells containing vectors with cloned cDNA inserts. PCR amplification uses 
primers complementary to the vector sequences flanking the cDNA insert Array dements are 
amplified in thirty cycles of PCR from an initial quantity of 1 -2 ng to a final quantity greater than 5 ng. 
Amplified array elements are then purified using SEPHACRYL-400 (Amersham Pharmacia Biotech). 

Purified array elements are immobilized on polymer-coated glass slides. Glass microscope 

1 o slides (Corning) are cleaned by ultrasound in 0. 1 % SDS and acetone, with extensive distilled water 

washes between and after treatments. Glass slides are etched in 4% hydrofluoric acid (VWR Scientific 

Products Corporation (VWR), West Chester, PA), washed extensively in distilled water, and coated 

i 

with 0.05% aminopropyl silane (Sigma) in 95% ethanol. Coated slides are cured in a 1 10°C oven. 

Array elements are applied to the coated glass substrate using a procedure described in US 
15 Patent No. 5,807,522, incorporated herein by reference. 1 jil of the array element DNA, at an average 
concentration of 100 ng/nl, is loaded into the open capillary printing element by a high-speed robotic 
apparatus. The apparatus then deposits about 5 nl of array element sample per slide* 

Microarrays are UV-crosslinked using a STRATALINKER UV-crosslinker (Stratagene). 
Microarrays are washed at room temperature once in 0.2% SDS and three times in distilled water. 

2 o Non-specific binding sites are blocked by incubation of microarrays in 0.2% casein in phosphate 

buffered saline (PBS) (Tropix, Inc., Bedford, MA) for 30 minutes at 60° C followed by washes in 0.2% 
SDS and distilled water as before. 

Hybridization 

2 s Hybridization reactions contain 9 jxl of probe mixture consisting of 0.2 ^ig each of Cy3 and 

Cy5 labeled cDNA synthesis products in 5X SSC, 0.2% SDS hybridization buffer. The probe mixture 
is heated to 65° C for 5 minutes and is aliquoted onto the microarray surface and covered with an 1 .8 
cm 2 coverslip. The arrays are transferred to a waterproof chamber having a cavity just slightly largo: 
than a microscope slide. The chamber is kept at 100% humidity internally by the addition of 140 |il of 

30 5x SSC in a corner of the chamber. The chamber containing the arrays is incubated for about 6.5 

hours at 60°C. The arrays are washed for 10 min at 45°C in a first wash buffer (IX SSC, 0.1% SDS), 
three times for 10 minutes each at 45° C in a second wash buffer (0.1X SSC), and dried. 

Detection 
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Reporter-labeled hybridization complexes are detected with a microscope equipped with an 
Innova 70 mixed gas 10 W laser (Coherent, Inc., Santa Clara CA) capable of generating spectral lines 
at 488 nm for excitation of Cy3 and at 632 rim for excitation of Cy5. The excitation laser light is 
focused on the array using a 20X microscope objective (Nikon, Inc., Melville NY). The slide 
5 containing the array is placed on a computer-controlled X-Y stage on the microscope and raster- 
scanned past the objective. The 1 .8 cm x 1 .8 cm array used in the present example is scanned with a 
resolution of 20 micrometers. 

In two separate scans, a mixed gas multiline laser excites the two fluorophores sequentially. 
Emitted light is split, based on wavelength, into two photomultiplier tube detectors (PMT R1477, 
10 Hamamatsu Photonics Systems, Bridgewater NJ) corresponding to the two fluorophores. Appropriate 
filters positioned between the array and the photomultiplier tubes are used to filter the signals. The 
emission maxima of the fluorophores used are 565 nm for Cy3 and 650 nm for Cy5. Each array is 
typically scanned twice, one scan per fluorophore using the appropriate filters at the laser source, 

1 although the apparatus is capable of recording the spectra from both fluorophores simultaneously; 
15 The sensitivity of the scans is typically calibrated using the signal intensity generated by a 

cDN A control species added to the probe mix at a known concentration. A specific location on the 
array contains a complementary DNA sequence, allowing the intensity of the signal at that location to 
be correlated with a weight ratio of hybridizing species of 1:100,000. When two probes from different 
sources (e.g„ representing test and control cells), each labeled with a different fluorophore, are 

2 o hybridized to a single array for the purpose of identifying genes that are differentially expressed/ the 

calibration is done by labeling samples of the calibrating cDNA with the two fluorophores and adding 
identical amounts of each to the hybridization mixture. 

The output of the photomultiplier tube is digitized using a 12-bit RTI-835H analog-to-digital 
(A/D) conversion board (Analog Devices, Inc., Norwood, MA) installed in an IBM-compatible PC 

2 5 computer. The digitized data are displayed as an image where the signal intensity is mapped using a 
linear 20-color transformation to a pseudocolor scale ranging from blue (low signal) to red (high 
signal). The data is also analyzed quantitatively. Where two different fluorophores are excited and 
measured simultaneously, the data are first corrected for optical crosstalk (due to overlapping emission 
spectra) between the fluorophores using each fluorophore's emission spectrum. 

30 A grid is superimposed over the fluorescence signal image such that the signal from each spot 

is centered in each element of the grid. The fluorescence signal within each element is then integrated to 
obtain a numerical value corresponding to the average intensity of the signal. The software used for 
signal analysis is the GEMTOOLS gene expression analysis program (Incyte). 
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XII. Complementary Nucleic Acids 

Sequences complementary to the dithp are used to detect, decrease* or inhibit expression of the 
naturally occurring nucleotide. The use of oligonucleotides comprising from about 15 to 30 base pairs 
is typical in theart However, smaller or largo: sequence fragments can also be used Appropriate 
oligonucleotides are designed from the dithp using OLIGO 4.06 software (National Biosciences) or 
other appropriate programs and are synthesized using methods standard in the art or ordered from a 

commercial supplier. To inhibit transcription, a complementary oligonucleotide is designed from the 

i 

most unique 5' sequence and used to prevent transcription factor binding to the promoter sequence." To 
inhibit translation, a complementary oligonucleotide is designed to prevent ribosomal binding and 
processing of the transcript 1 

XIIL Expression of DITHP 

Expression and purification of DITHP is accomplished using bacterial or virus-based 
expression systems. For expression of DITHP in bacteria, cDNA is subcloned into an appropriate 
vector containing an antibiotic resistance gene and an inducible promoter that directs high levels of 
cDNA transcriptiorL Examples of such promoters include, but are not limited to, the trpAac (tac) 
hybrid promote and the T5 or T7 bacteriophage promoter in conjunction with the lac operator 
regulatory element Recombinant vectors are transformed into suitable bacterial hosts, e.g., 
BL21 (DE3). Antibicrtic resistant bacteria express DITHP upon induction with isopropyl beta-D- 
thiogalactopyranoside (IPTG). Expression of DITHP in eukaryotic cells is achieved by infecting insect 
or mammalian cell lines with recombinant Autographica californica nuclear polyhedrosis virus 
(AcMNPV), commonly known as baculovirus. The nonessential polyhedrin gene of baculovirus is 
replaced with cDNA encoding DITHP by either homologous recombination or bacterial-mediated 
transposition involving transfer plasmid intermediates. Viral infectivity is maintained and the strong 
polyhedrin promoter drives high levels of cDNA transcription. Recombinant baculovirus is used to 
infect Spodoptera frugiperda (Sf9) insect cells in most cases, or human hepatocytes, in some cases. 
Infection of the latter requires additional genetic modifications to baculovirus. (See e.g., Engelhard, 
supra : and Sandig, supra .) 

In most expression systems, DITHP is synthesized as a fusion protein with, e.g., glutathione S- 
transferase (GST) or a peptide epitope tag, such as FLAG or 6-His, permitting rapid, single-step, 
affinity-based purification of recombinant fusion protein from crude cell lysates. GST, a 26-kilodalton 
enzyme from Schistosoma iaponicum . enables the purification of fiision proteins on immobilized 
glutathione under conditions that maintain protein activity and antigenicity (Amersham Pharmacia 
Biotech). Following purification, the GST moiety can be proteolytically cleaved from DITHP at 
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specifically engineered sites. FLAG, an 8-amino acid peptide, enables immunoaffinity purification 
using commercially available monoclonal and polyclonal anti-FLAG antibodies (Eastman Kodak 
Company, Rochester NY). 6-His, a stretch of six consecutive histidine residues, enables purification on 
metal-chelate resins (QIAGEN). Methods for protein expression and purification are discussed in 
5 Ausubel (1995, supra , Chapters 10 and 16). Purified DITHP obtained by these methods can be used 
directly in the following activity assay. 

XIV. Demonstration of DITHP Activity 

DITHP activity is demonstrated through a variety of specific assays, some of which are 
10 outlined below. 

Oxidoreductase activity of DITHP is measured by the increase in extinction coefficient of 
NAD(P)H coenzyme at 340 nmfor the measurement of oxidation activity, or the decrease in extinction 
coefficient of NAD(P)H coenzyme at 340 nmfor the measurement of reduction activity (Dalziel, K. 
(1963) J. Biol. Chem. 238:2850-2858). One of three substrates may be used: Asn-pGal, biocytidine, or 

is ubiquinone-10. The respective subunits of the enzyme reaction, for example, cytochtome Cj-b 
oxidoreductase and cytochrome c, are reconstituted. The reaction mixture contains a)l-2 mg/ml 
DITHP; and b) 15 mM substrate, 2.4 mM NAD(P) + in 0.1 M phosphate buffer, pH 7.1 (oxidation 
reaction), or 2.0 mM ,NAD(P)H, in 0.1 M Na 2 HP0 4 buffo-, pH 7.4 ( reduction reaction); in a total 
volume of 0. 1 mL Changes in absorbance at 340 nm (A^) are measured at 23.5 ° C using a recording 

20 spectrophotometer (Shimadzu Scientific Instruments, Inc., Pleasanton CA). The amount of NAD(P)H 
is stoichiometrically equivalent to the amount of substrate initially present, and the change in A^ is a 
direct measure of the amount of NAD(P)H produced; AA^o = 6620[NADH]. Oxidoreductase activity 
of DITHP activity is proportional to the amount of NAD(P)H presort in the assay. 

Transferase activity of DITHP is measured through assays such as a methyl transferase assay 

25 in which the transfer of radiolabeled methyl groups between a donor substrate and an acceptor 

substrate is measured (Bokar, J.A et al. (1994) J. Biol. Chem. 269:17697-17704). Reaction mixtures 
(50 pi final volume) contain 15 mM HEPES, pH 7.9, 1.5 mM MgCl 2 , 10 mM dithiothreitol, 3% 
polyvinylalcohol, 1.5 \xCi [metfry7- 3 H]AdoMet (0.375 |jM AdoMet) (DuPont-NEN), 0.6 ug DITHP, 
and acceptor substrate (0.4 |ig [ 35 S]RNA or 6-mercaptopurine (6-MP) to 1 mM final concentration). 

3 o Reaction mixtures are incubated at 30°C for 30 minutes, then 65 °C for 5 minutes. The products are 
separated by chromatography or electrophoresis and the level of methyl transferase activity is 
determined by quantification of methyl- 3 K recovery. 

DITHP hydrolase activity is measured by the hydrolysis of appropriate synthetic peptide 
substrates conjugated with various chromogenic molecules in which the degree of hydrolysis is 
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quantified by spectrophotometric (or fluorometric) absoiption of the released chromophore. (Beynon, 
RJ. and J.S. Bond (1994) Proteolytic Enzymes: A Practical Approach . Oxford University Press, New 
York NY, pp. 25-55) Peptide substrates are designed according to the category of protease activity as 
endopeptidase (serine, cysteine, aspartic proteases), animopeptidase (leucine aminopeptidase), or 
5 carboxypeptidase (Carboxypeptidase A and B, procollagen C-proteinase). 

DITHP isomerase activity such as peptidyl prolyl cis/trans isomerase activity can be assayed 
i by an enzyme assay described by Rahfeld, J.U., et al. (1994) (FEBS Lett 352: 180-184). The assay 

is performed at 10°C in 35 mM HEPES buffer, pH 7.8, containing chymotrypsin (0.5 mg/ml) and 
DITHP at a variety of concentrations. Under these assay conditions, the substrate, Suc-Ala-Xaa-Pro- 
l o Phe-4-N A, is in equilibrium with respect to the prolyl bond, with 80-95% in trans and 5-20% in cis 
confoimatioa An aliquot (2 ul) of the substrate dissolved in dimethyl sulfoxide (10 mg/ml) is added 
to the reaction mixture described above. Only the cis isomer of the substrate is a substrate for 
cleavage by chymotrypsin. Thus, as the substrate is isomerized by DITHP, the product is cleaved by 
chymotrypsin to produce 4-nitroanilide, which is detected by it's absorbance at 390 nm. 4- 
! is Nitroanilide appears in a time-dependent and a DITHP concentration-dependent manner. 

An assay for DITHP activity associated with growth and development measures cell 
proliferation as the amount of newly initiated DNA synthesis in Swiss mouse 3T3 ceils. A plasmid 
containing polynucleotides encoding DITHP is transfected into quiescent 3T3 cultured cells using 
methods well known in the art The transiently transfected cells are then incubated in the presence of 

2 o pH]thymidine, a radioactive DNA precursor. Where applicable, varying amounts of DITHP ligand are 

added to the transfected cells. Incorporation of [ 3 H]thymidine into acid-precipitable DNA is measured 
over an appropriate time interval, and the amount incorporated is directly proportional to the amount of 
newly synthesized DNA. 

Growth factor activity of DITHP is measured by the stimulation of DNA synthesis in Swiss 
25 mouse 3T3 cells (McKay, I. and I. Leigh, eds. (1993) Growth Factors: A Practical Approach , Oxford 
University Press, New York NY). Initiation of DNA synthesis indicates the cells' entry into the mitotic 
cycle and their commitment to undergo later division. 3T3 cells are competent to respond to most 
growth factors, not only those that are mitogenic, but also those that are involved in embryonic 
inductioa This competence is possible because the in vivo specificity demonstrated by some growth 

3 o factors is not necessarily inherent but is determined by the responding tissue. In this assay, varying 

amounts of DITHP are added to quiescent 3T3 cultured cells in the presence of [^thymidine, a 
radioactive DNA precursor. DITHP for this assay can be obtained by recombinant means or from 
biochemical preparations. Incorporation of [^thymidine into acid-precipitable DNA is measured over 
an appropriate time interval, and the amount incorporated is directly proportional to the amount of 
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. newly synthesized DNA A linear dose-response curve over, at least a hundred-fold DITHP 
concentration range is indicative of growth factor activity. One unit of activity per milliliter is defined 
as the concentration of DITHP producing a 50% response level, where 100% represents maximal 
incorporation of [ 3 H]thymidine into acid-precipitable DNA 
5 Alternatively, an assay for cytokine activity of DITHP measures the proliferation of 

leukocytes. In this assay, the amount of tritiated thymidine incorporated into newly synthesized DNA 
is used to estimate proliferative activity. Varying amounts of DITHP are added to cultured 
leukocytes, such as granulocytes, monocytes, or lymphocytes, in the presence of [ 3 H]thymidine, a 
radioactive DNA precursor. DITHP for this assay can he obtained by recombinant means or from 

1 o biochemical preparations. Incorporation of [ 3 H]thymidine into acid-precipitable DNA is measured 

over an appropriate time interval, and the amount incorporated is directly proportional to the amount 
of newly synthesized DNA. A linear dose-response curve ova: at least a hundred-fold DITHP 
concentration range is indicative of DITHP activity. One unit of activity per milliliter is 
conventionally defined as the concentration of DITHP producing a 50% response level, where 100% 

15 represents maximal incorporation of [ 3 H]thymidine into acid-precipitable DNA. 

An alternative assay for DITHP cytokine activity utilizes a Boyden micro chamber 
(Neuroprobe, Cabin John MD) to measure leukocyte chemotaxis (Vicari, supraV In this assay, about 
10 5 migratory cells such as macrophages or monocytes are placed in cell culture media in the upper 
compartment of the chamber. Varying dilutions of DITHP are placed in the lower compartment The 

20 two compartments are separated by a 5 or 8 micron pore polycarbonate filter (Nucleopore, Pleasanton 
CA). After incubation at 37 °C for 80 to 120 minutes, the filters are fixed in methanol and stained 
with appropriate labeling agents. Cells which migrate to the other side of the filter are counted using 
standard microscopy. The chemotactic index is calculated by dividing the number of migratory cells 
counted when DITHP is present in the lower compartment by the number of migratory cells counted 

2 5 when only media is present in the lower compartment The chemotactic index is proportional to the 

activity of DITHP. 

Alternatively, cell lines or tissues transformed with a vector containing dithp can be assayed for 
DITHP activity by immunoblotting. Cells are denatured in SDS in the presence of (J-mercaptoethanol, 
nucleic acids removed by ethanol precipitation, and proteins purified by acetone precipitation. Pellets 
30 are resuspended in 20 mM tris buffer at pH 7.5 and incubated with Protein G-Sepharose pre-coated 

with an antibody specific for DITHP. Alto: washing, the Sepharose beads are boiled in electrophoresis 
| sample buffer, and the eluted proteins subjected to SDS-PAGE. The SDS-PAGE is transferred to a 
nitrocellulose membrane for immunoblotting, and the DITHP activity is assessed by visualizing and 
quantifying bands on the blot using the antibody specific for DITHP as the primary antibody and 125 I- 
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labeled IgG specific for the primary antibody as the secondary antibody. 

DITHP kinase activity is measured by phosphorylation of a protein substrate using y-labeled 
[ 32 P]-ATP and quantitation of the incorporated radioactivity using a radioisotope counter. DITHP is 
incubated with the protein substrate, [ 32 P]-ATP, and an appropriate kinase buffer. The [ 32 P] 
5 incorporated into the product is separated from free [ 32 P]-ATP by electrophoresis and the incorporated 
[^P] is counted. The amount of [ 32 P] recovered is proportional to the kinase activity of DITHP in 
the assay. A determination of the specific amino acid residue phosphorylated is made by 
phosphoamino acid analysis of the hydrolyzed protein. 

In the alternative, DITHP activity is measured by the increase in cell prolif eration resulting 

10 from transformation of a mammalian cell line such as COS7i HeLa or CHO with an eukaryotic 

i i 
expression vector encoding DITHP. Eukaryotic expression vectors are commercially available, and 

the techniques to introduce them into cells are well known to those skilled in the art The cells are 

incubated for 48-72 hours after transformation under conditions appropriate for the cell line to allow 

expression of DITHP. Phase microscopy is then used to compare the mitotic index of transformed 

is versus control cells. An increase in the mitotic index indicates DITHP activity. 

In a further alternative, an assay for DITHP signaling activity is based upon the ability of 
GPCR family proteins to modulate G protein-activated second messenger signal transduction 
pathways (e.g., cAMP; Gaudin, P. et al. (1998) J. Biol. Chem, 273:4990-4996). Aplasmid encoding 
full length DITHP is transfected into a mammalian cell line (e.g., Chinese hamster ovary (CHO) or 

2 o human embryonic kidney (HEK-293) cell lines) using methods well-known in the art. Transfected 
cells are grown in 1 2-weH trays in culture medium for 48 hours, then the culture medium is discarded, 
and the attached cells are gently washed with PBS. The cells are then incubated in culture medium 
with or without ligand for 30 minutes, then the medium is removed and cells lysed by treatment with 
1 M perchloric acid. The cAMP levels in the lysate are measured by radioimmunoassay using 

2 5 methods well-known in the art. Changes in the levels of c AMP in the lysate from cells exposed to 
ligand compared to those without ligand are proportional to the amount of DITHP present in the 
transfected cells. 

Alternatively, an assay for DITHP protein phosphatase activity measures the hydrolysis of P- 
nitrophenyl phosphate (PNPP). DITHP is incubated together with PNPP in HEPES buffer pH 7.5, in 
30 the presence of 0. 1 % P-mercaptoethanol at 37 °C for 60 min. The reaction is stopped by the addition of 
6 ml of 10 N NaOH, and the increase in light absorbance of the reaction mixture at 410 nm resulting 
from the hydrolysis of PNPP is measured using a spectrophotometer. The increase in light absorbance 
is proportional to the phosphatase activity of DITHP in the assay (Diamond, R.H. et al (1994) Mol Ce31 
Biol 14:3752-3762). 
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An alternative assay measures DITHP-mediated G-protein signaling activity by monitoring 
the mobilization of Ca"* as an indicator of the signal transduction pathway stimulation. (See, e.g., 
Grynkievicz, G. et al. (1985) J. Biol. Chem. 260:3440; McCoIl, S. et al. (1993) J. Immunol. 
150:4550-4555; arid Aussel, C. et al. (1988) J. Immunol. 140:215-220). Hie assay requires 
5 preloading neutrophils or T cells with a fluorescent dye such as FURA-2 or BCECF (Universal 
Imaging Corp, Westchester PA) whose emission characteristics are altered by Ca** binding. When 
the cells are exposed to one or more activating, stimuli artificially (e.g., anti-CD3 antibody ligation of 
the T cell receptor) or physiologically (e.g., by allogeneic stimulation), Ca** flux takes place. This 
flux can be observed and quantified by assaying the cells in a fluorometer or fluorescent activated cell 

10 sorter. Measurements of Ca** flux are compared between cells in their normal state and those 

transfected with DITHP. Increased Ca** mobilization attributable to increased DITHP concentration 
is proportional to DITHP activity. 

DITHP transport activity is assayed by measuring uptake of labeled substrates into Xenopus 
laevis oocytes. Oocytes at stages V and VI are injected with DITHP mRNA (10 ng per oocyte) and 

15 incubated for 3 days at 18°C in OR2 medium (82.5mM NaCl, 2.5 mM KC1, ImM CaCl 2 , ImM MgCl^ 
ImM Na^HPO^ 5 mM Hepes, 3.8 mM NaOH, 50|ug/ml gentamycin, pH 7.8) to allow expression of 
DITHP protein. Oocytes are then transferred to standard uptake medium (lOOmM NaCl, 2 mM KCI, 
ImM CaCl 2 , ImM MgCIs, 10 mM Hepes/Tris pH 7.5). Uptake of various substrates (e.g., amino 
acids, sugars, drugs, ions, and neurotransmitters) is initiated by adding labeled substrate (e*g. 

20 radiolabeled with 3 H, fluorescently labeled with rhodamine, etc.) to the oocytes. After incubating for 30 
minutes, uptake is terminated by washing the oocytes three times in Na + -free medium, measuring the 
incorporated label, and comparing with controls. DITHP transport activity is proportional to the level 
of internalized labeled substrate. 

DITHP transferase activity is demonstrated by a test for galactosyltransferase activity. This 

2 5 can be determined by measuring the transfer of radiolabeled galactose from UDP-galactose to a 

GlcNAc-terminated oligosaccharide chain (Kolbinger, F. et al. (1998) J. Biol. Chan. 273:58-65). The 
sample is incubated with 14 pi of assay stock solution (180 mM sodium cacodylate, pH 6.5, 1 rng/nfl 
bovine serum albumin, 0.26 mM UDP-galactose, 2 |il of UDP-[ 3 H]galactose), 1 |il of MnCl* (500 
mM), and 2.5 |il of GlcNAcpO^CH^-CO^Me (37 mg/ml in dimethyl sulfoxide) for 60 minutes at 

30 37 °C. The reaction is quenched by the addition of 1 ml of water and loaded on a CI 8 Sep-Pak 
cartridge (Waters), and the column is washed twice with 5 ml of water to remove unreacted UDP- 
pH]galactose. The pH]galactosylated GlcNAcpO-(CH 2 ) 8 -C0 2 Me remains bound to the column during 
the water washes and is eluted with 5 ml of methanol. Radioactivity in the eluted material is measured 
by liquid scintillation counting and is proportional to galactosyltransferase activity in the starting 
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sample. 

In the alternative, DITHP induction by beat or toxins may be demonstrated using primary 
cultures of human fibroblasts or human cell lines such as CCL-13, HEK293, or HEP G2 (ATCC). To 
heat induce DITHP expression, aliquots of cells are incubated at 42 °C for 15, 30, or 60 minutes. 
5 Control aliquots are incubated at 37 °C for the same time periods. To induce DITHP expression by 
toxins, aliquots of cells are treated with 100 \M arsenite or 20 mM azetidine-2-carboxylic acid for 0, 
3, 6, or 12 hours. After exposure to heat, arsenite, or the amino acid analogue, samples of the treated 
ceils are harvested and cell lysates prepared for analysis by western blot. Cells are lysed in lysis 
buffer containing 1% Nonidet P-40, 0.15 M NaCl, 50 mM Tris-HCl, 5 mM EDTA, 2 mM, ' 

10 N-ethylmaleimide, 2 mM phenylmethylsulfonyl fluoride, 1 mg/ml leupeptin, and 1 mg/ml pepstatin. 
Twenty micrograms of the cell lysate is separated on an 8% SDS-PAGE gel and transferred to a 
membrane. After blocking with 5% nonfat dry milk/phosphate-buffered saline for 1 h, the membrane 
is incubated overnight at 4°C or at room temperature for 2-4 hours with a 1 : 1000 dilution of 
anti-DITHP serum in 2% nonfat dry milk/phosphate-buffered saline. The membrane is then washed 

1 5 and incubated with a 1 : 1000 dilution of horseradish peroxidase-conjugated goat anti-rabbit IgG in 2% 
• dry milk/phosphate-buffered saline. After washing with 0.1% Tween 20 in phosphate-buffered saline,- 
the DITHP protein is detected and compared to controls using chemiluminescence. 

Alternatively, DITHP protease activity is measured by the hydrolysis of appropriate synthetic 
peptide substrates conjugated with various chromogenic molecules in which the degree of hydrolysis 

20 is quantified by spectrophotometry (or fluorometric) absorption of the released cbromophore 

(Beynon, RJ. and J.S. Bond (1994) Proteolytic Enzymes: A Practical Approach, Oxford University 
Press, New York, NY, pp.25-55). Peptide substrates are designed according to the category of 
protease activity as endopeptidase (serine, cysteine, aspartic proteases, or metalloproteases), 
aminopeptidase Oeucine aminopeptidase), or carboxypeptidase (carboxypeptidases A and B, 

25 procollagen C-proteinase). Commonly used chromogens are 2-naphthylamine, 4-nitroaniline, and 
furylacrylic acid. Assays are performed at ambient temperature and contain an aliquot of the enzyme 
and the appropriate substrate in a suitable buffer. Reactions are carried out in an optical cuvette, and 
the increase/decrease in absorbance of the chromogen released during hydrolysis of the peptide 
substrate is measured. Hie change in absorbance is proportional to the DITHP protease activity in the 

30 assay. 

In the alternative, an assay for DITHP protease activity takes advantage of fluorescence 
resonance energy transfer (FRET) that occurs when one donor and one acceptor fluorophore with an 
appropriate spectral overlap are in close proximity. A flexible peptide linker containing a cleavage 
site specific for PRTS is fused between a red-shifted variant (RSGFP4) and a blue variant (BFP5) of 
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Green Fluorescent Protein. This fusion protein has spectral properties that suggest energy transfer is 
occurring from BFP5 to RSGFP4. When the fusion protein is incubated with DITHP, the substrate is 
cleaved, and the two fluorescent proteins dissociate. This is accompanied by a marked decrease in 
energy transfer which is quantified by comparing the emission spectra before and after the addition of 
5 DITHP (Mitra, R.D. et al (1996) Gene 173:13-17). This assay can also be performed in living cells. 
In this case the fluorescent substrate protein is expressed cohstitutively in cells and DITHP is 
introduced on an inducible vector so that FRET can be monitored in the presence and absence of 
DITHP (Sagot, I. et al (1999) FEBS Lett. 447:53-57). 

A method to determine the nucleic acid binding activity of DITHP involves a poly acrylamide 

l o gel mobility-shift assay. In preparation for this assay, DITHP is expressed by transforming a 

mammalian cell line such as COS7, HeLa or CHO with a eukaryotic expression vector containing 
DITHP cDNA. The dells are incubated for 48-72 hours after transformation under conditions 
appropriate for the cell line to allow expression and accumulation of DITHP. Extracts containing 
solubilized proteins can be prepared from cells expressing DITHP by methods well known in the art 

is Portions of the extract containing DITHP are added to [ 32 P]-labeled RNA or DNA. Radioactive 
nucleic acid can be synthesized in vitro by techniques well known in the art The mixtures are • 
incubated at 25 °C in the presence of RNase- and DNase-inhibitors under buffered conditions for 5-10 
minutes. After incubation, the samples are analyzed by polyacrylamide gel electrophoresis followed 
by autoradiography. The presence of a band on the autoradiogram indicates the formation of a 

20 complex between DITHP and the radioactive transcript. A band of similar mobility will not be 
present in samples prepared using control extracts prepared from untransformed cells. 

In the alternative, a method to determine the methylase activity of a DITHP measures transfer 
of radiolabeled methyl groups between a donor substrate and an acceptor substrate. Reaction 
mixtures (50 pi final volume) contain 15 mM HEPES, pH 7.9, 1.5 mM MgCl 2 , 10 mM dithiothreitol, 

25 3% polyvinylalcohol, 1.5 |iCi [mcr/iyZ- 3 H]AdoMet (0.375 jaM AdoMet) (DuPont-NEN), 0.6 \ig 
DITHP, and acceptor substrate (e.g., 0.4 jig [ 35 S]RNA, or 6-mercaptopurine (6-MP) to 1 mM final 
concentration). Reaction mixtures are incubated at 30°C for 30 minutes, then 65 °C for 5 minutes. 
Analysis of [methyl- 3 H]KNA is as follows: 1) 50 pi of 2 x loading buffer (20 mM Tris-HCl, pH 7.6, 1 
M LiCl, 1 mM EDTA, 1% sodium dodecyl sulphate (SDS)) and 50 \j1 oligo d(T)-cellulose (10 mg/ml 

30 in 1 x loading buffer) are added to the reaction mixture, and incubated at ambient temperature with 
shaking for 30 minutes. 2) Reaction mixtures are transferred to a 96- well filtration plate attached to a 
vacuum apparatus. 3) Each sample is washed sequentially with three 2.4 ml aliquots of 1 x oligo d(T) 
loading buffer containing 0.5% SDS, 0.1% SDS, or no SDS. and 4) RNA is eluted with 300 pi of 
water into a 96-well collection plate, transferred to scintillation vials containing liquid scintillant, and 
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radioactivity determined. Analysis of [metfiyZ- 3 H]6-MP is as follows: 1) 500 \A 0.5 M borate buffer, 
pH 10.0, and then 2.5 ml of 20% (v/v) isoamyl alcohol in toluene are added to the reaction mixtures. 
2) The samples mixed by vigorous vortexing for ten seconds. 3) After centrifugation at 700g for 10 
minutes, 1.5 ml of the organic phase is transferred to scintillation vials containing 0.5 ml absolute 
5 ethanol and liquid scintillant, and radioactivity determined, and 4) Results are corrected for the 
extraction of 6-MP into the organic phase (approximately 41 %). 

An assay for adhesion activity of DITHP measures the disruption of cytoskeletal filament 
networks upon overexpression of DITHP in cultured cell lines (Rezniczek, G.A. et al. (1998) J. Ceil 
Biol. 141 :209-225). cDNA encoding DITHP is subcloned into a mammalian expression vector that 

10 drives high levels of cDNA expression. This construct is transfected into cultured cells, such as rat 
kangaroo PtK2 or rat bladder carcinoma 804G cells. Actin filaments and intermediate filaments such 
as keratin and vimentin are visualized by immunofluorescence microscopy using antibodies and 
techniques well known in the art. The configuration and abundance of cytoskeletal filaments can be 
assessed and quantified using confocal imaging techniques. In particular, the bundling and collapse 

( 15 of cytoskeletal filament networks is indicative of DITHP adhesion activity. 

Alternatively, an assay for DITHP activity measures the expression of DITHP on the cell 
surface. cDNA encoding DITHP is transfected into a non-leukocytic cell line. Cell surf ace proteins 
are labeled with biotin(de la Fuente, M. A. et al. (1997) Blood 90:2398-2405). Immunoprecipitations 
are performed using DITHP-specific antibodies, and immunoprecipitated samples are analyzed using 

2 o SDS-PAGE and immunoblotting techniques. The ratio of labeled immunoprecipitant to unlabeled 
immunoprecipitant is proportional to the amount of DITHP expressed on the cell surface. 

Alternatively, an assay for DITHP activity measures the amount of cell aggregation induced 
by overexpression of DITHP. In this assay, cultured cells such as NIH3T3 are transfected with cDNA 
encoding DITHP contained within a suitable mammalian expression vector under control of a strong 

2 5 promoter. Cotransfection with cDNA encoding a fluorescent marker protein, such as Green 

Fluorescent Protein (CLONTECH), is useful for identifying stable transfectants. The amount of cell 
agglutination, or clumping, associated with transfected cells is compared with that associated with 
untransfected cells. The amount of cell agglutination is a direct measure of DITHP activity. 

DITHP may recognize and precipitate antigen from serum. This activity can be measured by 

30 the quantitative precipitin reaction (Golub, E.S. et al. (1987) Immunology: A Synthesis* Sinauer 

Associates, Sunderland MA, pages 113-115). DITHP is isotopically labeled using methods known in 
the art Various serum concentrations are added to constant amounts of labeled DITHP. DITHP- 
antigen complexes precipitate out of solution and are collected by centrifugation. The amount of 
precipitable DITHP-antigen complex is proportional to the amount of radioisotope detected in the 
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precipitate. The amount of precipitable DITHP-antigen complex is plotted against the serum 
concentration. For various serum concentrations, a characteristic precipitation curve is obtained, in 
which the amount of precipitable DITHP-antigen complex initially increases proportionately with 
increasing serum concentration, peaks at the equivalence point, and then decreases proportibnately with 
5 further increases in serum concentratioa Thus, the amount of precipitable DITHP-antigen complex is 
a measure of DITHP activity which is characterized by sensitivity to both limiting and excess quantities 
ofantigea 

A microtubule motility assay for DITHP measures motor protein activity. In this assay, 
recombinant DITHP is immobilized onto a glass slide or similar substrate. Taxol-stabilized bovine 

10 brain microtubules (commercially available) in a solution containing ATP and cytosolic extract are 
perfused onto the slide. Movement of microtubules as driven by DITHP motor activity can be 
visualized and quantified using video-enhanced light microscopy and image analysis techniques. 
DITHP motor protein activity is directly proportional to the frequency and velocity of microtubule 

, movement. 

15 Alternatively, an assay for DITHP measures the formation of protein filaments in vitro . A 

solution of DITHP at a concentration greater than the "critical concentration" for polymer assembly is 
applied to carbon-coated grids. Appropriate nucleation sites may be supplied in the solution. The 
grids are negative stained with 0.7% (w/v) aqueous uranyl acetate and examined by electron 
microscopy. The appearance of filaments of approximately 25 nm (microtubules), 8 nm (actin), or 10 

2 o nm (intermediate filaments) is a demonstration of protein activity. 

DITHP electron transfer activity is demonstrated by oxidation or reduction of NADP. 
Substrates such as Asn-pGal, biocytidine, or ubiquinone- 10 may be used. The reaction mixture 
contains 1-2 mg/ml HORP, 15 mM substrate, and 2.4 mM NAD(P) + in 0.1 M phosphate buffer, pH 
7.1 (oxidation reaction), or 2.0 mM NAD(P)H, in 0.1 M Na 2 HP0 4 buffer, pH 7.4 (reduction reaction); 

25 in a total volume of 0. 1 ml. FAD may be included with NAD, according to methods well known in the 
art Changes in absorbance are measured using a recording spectrophotometer. The amount of 
NAD(P)H is stoichiometrically equivalent to the amount of substrate initially present, and the change in 
A^ is a direct measure of the amount of NAD(P)H produced; AA^ = 6620[NADH]. DITHP activity 
is proportional to the amount of NAD(P)H present in the assay. The increase in extinction coefficient 

30 of NAD(P)H coenzyme at 340 nm is a measure of oxidation activity, or the decrease in extinction 

coefficient of NAD(P)H coenzyme at 340 nm is a measure of reduction activity (Dalziel, K. {1963) J. 
Biol. Chem. 238:2850-2858). 

DITHP transcription factor activity is measured by its ability to stimulate transcription of a 
reporter gene (Liu, H.Y. et al. (1997) EMBO J. 16:5289-5298). The assay entails the use of a well 
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characterized reporter gene construct, LexA^-LacZ, that consists of LexA DNA transcriptional control 
dements (LexA^ fused to sequences encoding the E. coli LacZ enzyme. Hie methods for constructing 
and expressing fusion genes, introducing than into cells, and measuring LacZ enzyme activity, are well 
known to those skilled in the art Sequences encoding DITHP are cloned into a plasmid that directs the 
5 synthesis of a fusion protein, LexA-DITHP, consisting of DITHP and a DNA binding domain derived 
from the LexA transcription factor. Hie resulting plasmid, encoding a LexA-DITHP fusion protein, is 
introduced into yeast cells along with a plasmid containing the LexA^-LacZ reporter gene. The amount 
of LacZ enzyme activity associated with LexA-DITHP transfected cells, relative to control cells, is 
proportional to the amount of transcription stimulated by the DITHP. 

10 Chromatin activity of DITHP is demonstrated by measuring sensitivity to DNase I (Dawson, 

B.A et al. (1989) J. Biol. Chem. 264:12830-12837). Samples are treated with DNase I, followed by 
insertion of a cleavable biotinylated nucleotide analog, 5-[(N-biotinamido)hexanoamido-ethyH,3- 
tMopropionyl-3-aminoallyl]-2'-deoxyuridine 5 '-triphosphate, using nick-repair techniques well known 
to those skilled in the art Following purification and digestion with EcoRI restriction endonuclease, 

1 5 biotinylated sequences are affinity isolated by sequential binding to streptavidin and biotincellulose. 
Another specific assay demonstrates the ion conductance capacity of DITHP using an 
electrophysiological assay. DITHP is expressed by transforming a mammalian cell line such as 
COS7, HeLa or CHO with a eukaryotic expression vector encoding DITHP. Eukaryotic expression 
vectors are commercially available, and the techniques to introduce them into cells are well known to 

2 o those skilled in the art. A small amount of a second plasmid, which expresses any one of a number of 
marker genes such as P-galactosidase, is co-transformed into the cells in order to allow rapid 
identification of those cells which have taken up and expressed the foreign DNA The cells are 
incubated for 48-72 hours after transformation under conditions appropriate for the cell line to allow 
expression and accumulation of DITHP and p-galactosidase. Transformed cells expressing (5- 

2 5 galactosidase are stained blue when a suitable colorimetric substrate is added to the culture media 

undo: conditions that are well known in the art Stained cells are tested for differences in membrane 
conductance due to various ions by electrophysiological techniques that are well known in the art. 
Untransf ormed cells, and/or cells transformed with either vector sequences alone or P-galactosidase 
sequences alone, are used as controls and tested in parallel. The contribution of DITHP to cation or 

3 o anion conductance can be shown by incubating the cells using antibodies specific for either DITHP. 

The respective antibodies will bind to the extracellular side of DITHP, thereby blocking the pore in 
the ion channel, and the associated conductance, 

XV. Functional Assays 
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DITHP function is assessed by expressing dithp at physiologically elevated levels in 
mammalian cell culture systems. cDNA is subcloned into a mammalian expression vector containing a 
strong promoter that drives high levels of cDNA expression. Vectors of choice include pCMV SPORT 
(Life Technologies), and pCR3.1 (Invitrogen Corporation, Carlsbad CA), both of which contain the 
5 cytomegalovirus promote". 5-10 fig of recombinant vector are transiently transfected into a human cell 
line, preferably of endothelial or hematopoietic origin, using either liposome formulations or 
electroporation. 1-2 p.g of an additional plasmid containing sequences encoding a marker protein are 
co-transfected. 

Expression of a marker protein provides a means to distinguish transfected cells from 
10 nontransfected cells and is a reliable predictor of cDNA expression from the recombinant vector. 

Marker proteins of choice include, e.g., Green Fluorescent Protein (GFP; CLONTECH), CD64, or a 
CD64-GFP fusion protein. Flow cytometry (FCM), an automated laser optics-based technique, is used 
to identify transfected cells expressing GFP or CD64-GFP and to evaluate the apoptotic state of the 
cells and other cellular properties. 
15 FCM detects and quantifies the uptake of fluorescent molecules that diagnose events preceding 

or coincident with cell death. These events include changes in nuclear DNA content as measured by 
staining of DNA with propidium iodide; changes in cell size and granularity as measured by forward 
light scatter and 90 degree side light scatter; down-regulation of DNA synthesis as measured by 
decrease in bromodeoxyuridine uptake; alterations in expression of cell surface and intracellular 
20 proteins as measured by reactivity with specific antibodies; and alterations in plasma membrane * 
composition as measured by the binding of fluorescein-conjugated Annexin V protein to the cell 
surface. Methods in flow cytometry are discussed in Ormerod, M. G. (1994) Flow Cytometry . Oxford, 
New York NY. 

The influence of DITHP on gene expression can be assessed using highly purified populations 
25 of cells transfected with sequences encoding DITHP and either CD64 or CD64-GFP. CD64 and 
CD64-GFP are expressed on the surface of transfected cells and bind to conserved regions of human 
immunoglobulin G (IgG). Transfected cells are efficiently separated from nontransfected cells using 
magnetic beads coated with either human IgG or antibody against CD64 (DYNAL, Inc., Lake SucGess 
NY). mRNA can be purified from the cells using methods well known by those of skill in the art 
3 o Expression of mRNA encoding DITHP and other genes of interest can be analyzed by northern analysis 
or microarray techniques. 



XVI. Production of Antibodies 

DITHP substantially purified using polyacrylamide gel electrophoresis (PAGE; see, e.g., 
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Harrington, M.G. (1990) Methods Enzymol. 182:488-495), or other purification techniques, is used to 

immunize rabbits and to produce antibodies using standard protocols. 

Alternatively, the DITHP amino acid sequence is analyzed using LASERGENE software 

(DNASTAR) to determine regions of high immunogenicity, and a corresponding peptide is synthesized 
5 and used to raise antibodies by means known to those of skill in the art Methods for selection of 

appropriate epitopes, such as those near the C-terminus or in hydrophilic regions are well described iji 

the art (See, e.g., Ausubel, 1995, supra. Chapter 11.) 

Typically, peptides 15 residues in length are synthesized using an ABI 431A peptide 

synthesizer (Applied Biosystems) using fmbc-chemistry and coupled to KLH (Sigma) by reaction with 
10 N-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS) to increase immunogenicity. (See, e.g., 

Ausubel, supra .) Rabbits are immunized with the peptide-KLH complex: in complete Freund's 

adjuvant Resulting antisera are tested for antipeptide activity by, for example, binding the peptide to 

plastic, blocking with 1 % BSA, reacting with rabbit antisera, washing, and reacting with radio- 

iodinated goat anti-rabbit IgG. Antisera with antipeptide activity are tested for anti-DITHP activity 
1 5 using protocols well known in the art, including ELIS A, RIA, and immunoblotting. 

XVIL Purification of Naturally Occurring DITHP Using Specific Antibodies 

Naturally occurring or recombinant DITHP is substantially purified by immunoaffinity 

chromatography using antibodies specific for DITHP. An immunoaffinity column is constructed by 
20 covalently coupling anti-DITHP antibody to an activated chromatographic resin, such as 

CNBr-activated SEPHAROSE (Amersham Pharmacia Biotech). After the coupling, the resin is 

blocked and washed according to the manufacturer's instructions. 

Media containing DITHP are passed over the immunoaffinity column, and the column is 

washed under conditions that allow the preferential absorbance of DITHP (e.g., high ionic strength 

2 5 buffers in the presence of detergent). The column is eluted under conditions that disrupt 

antibody/DITHP binding (e.g„ a buffo* of pH 2 to pH 3, or a high concentration of a chaotrope, such 
as urea or thiocyanate ion), and DITHP is collected. 

XVIII. Identification of Molecules Which Interact with DITHP 

3 o DITHP, or biologically active fragments thereof, are labeled with 125 I Bolton-Hunter reagent 

(See, e.g., Bolton, A.E. and W.M. Hunter (1973) Biochem. J. 133:529-539.) Candidate molecules 
previously arrayed in the wells of a multi-well plate are incubated with the labeled DITHP, washed, and 
any wells with labeled DITHP complex are assayed. Data obtained using different concentrations of 
DITHP are used to calculate values for the number, affinity, and association of DITHP with the 
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candidate molecules. 

Alternatively, molecules interacting' with DITHP are analyzed using the yeast two-hybrid 
system as described in Fields, S. and O. Song (1989) Nature 340:245-246, or using commercially 
available kits based on the two-hybrid system, such as the MATCHMAKER system (CLONTECH). 
5 DITHP may also be used in the PATHCALLING process (CuraGen Corp., New Haven CT) 

which employs the yeast two-hybrid system in a high-throughput maimer to determine all interactions 
between the proteins encoded by two large libraries of genes (Nandabalan, K. et al. (2000) U.S. Patent 
No. 6,057,101). 

10 All publications and patents mentioned in the above specification are herein incorporated by 

reference. Various modifications and variations of the described method and system of the invention 
will be apparent to those skilled in the art without departing from the scope and spirit of the invention. 
Although the invention has been described in connection with specific preferred embodiments, it should 
be understood that the invention as claimed should not be unduly limited to such specific embodiments. 

15 Indeed, various modifications of the above-described modes for carrying out the invention which are 
obvious to those skilled in the field of molecular biology or related fields are intended to be within the 
scope of the following claims. 
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"ABLE 3 




Template ID 


SEQ ID NO: 




1 


LG:1040582.1:2000FEB18 


1 


LG:1040582.1 :2000FEB18 


1 


LG:1040582.1 :2000FEB18 


2 


LG:453570.1:2000FEB18 


3 


LG:408751 .3:2000FEB1 8 


3 


LG:408751 .3:2000FEB1 8 


3 


LG:408751.3:2000FEB18 


3 


LG:408751 .3:2000FEB18 


3 


LG:408751 .3:2000FEB18 


3 


LG:408751 .3:2000FEB18 


3 


LG:408751 .3:2000FEB18 


3 


LG:408751 .3:2000FEB18 


4 


Ll:090574.1 :2000FEB01 


4 


Ll:090574.1 :2000FEB01 


4 


Ll:090574.1 :2000FEB01 


4 


Ll:090574.1 :2000FEB01 


4 


Ll:090574.1 :2000FEB01 


4 


Ll:090574.1 :2000FEB01 


5 


LI:229932.2:20D0FEB01 


5 


LI:229932.2:2000FEB01 


5 


LI:229932.2:2000FEB01 


, 5 


LI:229932.2:2000FEB01 


5 


LI:229932.2:2000FEB01 


5 


LI:229932.2:2000FEB01 


5 


LI:229932.2:2000FEB01 


5 


LI:229932.2:2000FEB01 


5 


L1:229932.2:2000FEB01 


5 


LI:229932.2:2000FEB01 


5 


LI:229932.2:2000FEB01 


5 


LI:229932.2:2000FEB01 


6 


Ll:3321 76.1 :2000FEB01 


6 


Ll:332176.1 :2000FEB01 


6 


Ll:3321 76.1 :2000FEB01 


6 


Ll:3321 76.1 :2000FEB01 


6 


Ll:332176.1 :2000FEB01 


6 


LI:332176.1:2000FEB01 


6 


Ll:332176.1:2000FEB01 


6 


LI:332176.1:2000FEB01 


6 


LI:332176.1:2000FEB01 


6 


LI:332176.1:2000FEB01 


6 


LI:332176.1:2000FEB01 


6 


LI:332176.1:2000FEB01 


6 


LI:332176.1:2000FEB01 


6 


LI:332176.1:2000FEB01 


6 


LI:332176.1:2000FEB01 


6 


LI:332176.1:2000FEB01 


6 


LI:332176.1:2000FEB01 


6 


LI:332176.1:2000FEB01 


6 


LI:332176.1:2000FEB01 


8 


LG:220992.1 :2000MAY19 


8 


LG:220992.1 :2000MAY1 9 


8 


LG:220992.1 :2000MAY19 


8 


LG:220992.1 :2000MAY19 



I 

I 



PCTAJS01/06059 



Start 


Stop 


Frame 


Domain 


Topology 








' Tvd© 




31 


117 


forward 1 


TM 


Nin 


319 


405 


forward 1 


TM 


Nin 


108 


155 


forward 3 


TM 


N out 


361 


447 


forward 1 


TM 


Nin 


1318 


1404 


forward 1 


TM 


Nin 


1025 


1099 


forward 2 


TM 


Nin 


1298 


1360 


forward 2 


TM 


Nin- 


1379 


1441 


forward 2 


TM 


Nin 


1463 


1537 


forward 2 


TM 


Nin 


1047 


1133 


forward 3 


TM 


N in 


1266 


1352 


forward 3 


TM 


Nin 


1419 


1469 


forward 3 


TM 


N in 


79 


144 


forward 1 


TM 


N in 


607 


678 


forward 1 


TM 


N in 


1009 


1080 


forward 1 


TM 


N in 


497 


583 


forward 2 


TM 


N out 


743 


829 


forward 2 


TM 


N out 


1026 


1085 


forward 3 


TM 


Nout 


76 


162 


forward 1 


TM 


N out 


190 


276 


forward 1 


TM 


N out 


1237 


1323 


forward 1 


TM 


N out 


68 


142 


forward ,2 


TM 


N in 


335 


412 


forward 2 


TM 


N in 


758 


844 


forward 2 


TM 


N in 


1229 


1288 


forward 2 


TM 


N in 


60 


146 


forward 3 


TM 


N in 


216 


302 


forward 3 


TM 


N in 


690 


752 


forward 3 


TM 


N in 


765 


827 


forward 3 


TM 


N in 


1209 


1289 


forward 3 


TM 


N in 


343 


399 


forward 1 


TM 


N in 


1078 


1131 


forward 1 


TM 


N in 


1606 


1692 


forward 1 


TM 


N fn 


2218 


2274 


forward 1 


TM 


N in 


2383 


2433 


forward 1 


TM 


N in 


110 


196 


forward 2 


TM 


N in 


1307 


1378 


forward 2 


TM 


N in 


1640 


1726 


forward 2 


TM- 


Nin 


1946 


2005 


forward 2 


TM 


Nin 


135 


200 


forward 3 


TM 


Nin 


693 


752 


forward 3 


TM 


Nin 


777 


839 


forward 3 


TM 


Nin 


867 


929 


forward 3 


TM 


Nin 


1035 


1118 


forward 3 


TM 


N in 


1173 


1253 


forward 3 


TM 


Nin 


1572 


1658 


forward 3 


TM 


Nin 


2121 


2180 


forward 3 


TM 


N in 


2277 


2363 


forward 3 


TM 


Nin 


2400 


2456 


forward 3 


TM 


Nin 


343 


393 


forward 1 


TM 




646 


732 


forward 1 


TM 




1639 


1725 


forward 1 


TM 




1879 


1965 


forward 1 


TM 




194 " 
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8 


LG:220992.1 :2000M AY1 9 


2005 


8 


LG:220992.1 :2000MAY1 9 


17 


8 


LG:220992.1 :2000MAY1 9 


, 1646 


8 


LG:220992.1 :2000M AY1 9 


1850 


8 


LG:220992.1 :2000MAY1 9 


1434 


8 


LG:220992.1 :2000MAY1 9 


1734 


8 


LG-.220992.1 :2000MAY19 


1974 


8 


LG.220992.1 :2000MAY1 9 


2067 


8 


LG:220992.1 :2000MAY1 9 


2151 


9 


LG.1094571 .1 :2000MAY19 


781 


9 


LG:1 094571 .1 :2000MAY1 9 


419 


9 


LG:1 094571 .1 :2000MAY1 9 


767 


9 


LG:1 094571 .1 :2000MAY1 9 


756 


10 


LI:350754.4:2000MAY01 


277 


10 


LI:350754.4:2000MAY01 


583 


10 


LI:350754.4:2000MAY01 


670 


10 


LI:350754.4:2000MAY01 


381 


10 


LI:350754.4:2000MAY01 


2469 


12 


Ll:1 1 90263.1 :2000MAY01 


664 


12 


Ll:1 1 90263.1 :2000MAY01 


787 


12 


Ll:1 1 90263.1 :2000MAY01 


901 


12 


Ll:1 1 90263.1 .2000MAY01 


188 


12 


Ll:1 1 90263.1 :2000MAY01 


455 


12 


Ll:1 1 90263.1 :2000MAY01 


809 


12 


Ll:1 1 90263.1 :2000MAY01 


1616 


12 


Ll:1 1 90263.1 :2000MAY01 


183 


12 


Ll:1 1 90263.1 :2000MAY01 


648 


12 


Ll:1 1 90263.1 :2000MAY01 


1149 


13 


LG:270916.2:2000FEB18 


173 


14 


LG:999414.3:2000FEB18 


109 


14 


LG:99941 4.3:2000FEB1 8 


358 


14 


LG:99941 4.3:2000FEB1 8 


520 


14 


LG:99941 4.3:2000FEB1 8 


661 


14 


LG:99941 4.3:2000FEB1 8 


883 


14 


LG:999414.3:2000FEB18 


976 


14 


LG:99941 4.3:2000FEB1 8 


302 


14 


LG:99941 4.3:2000FEB1 8 


533 


14 


LG:999414.3:2000FEB18 


992 


14 


LG:99941 4.3:2000FEB1 8 


1169 


14 


LG:99941 4.3:2000FEB1 8 


1307 


14 


LG:99941 4.3:2000FEB1 8 


207 


14 


LG:999414.3:2000FEB18 


324 


14 


LG:99941 4.3:2000FEB1 8 


540 


14 


LG:99941 4.3:2000FEB1 8 


1029 


14 


LG:99941 4.3:2000FEB1 8 


1167 


14 


LG:99941 4.3:2000FEB1 8 


1314 


15 


LG:429446.1 :2000FEB1 8 


628 


15 


LG:429446.1 '.2000FEB1 8 


629 


15 


LG:429446.1 :2000FEB1 8 


627 


16 


LI:057229.1:2000FEB01 


10 


16 


Ll:057229.1 :2000FEB01 


118 


16 


Ll:057229.1 :2000FEB01 


292 


16 


Ll:057229.1 :2000FEB01 


11 


16 


Ll:057229.1 :2000FEB01 


146 


16 


Ll:057229;1 :2000FEB01 


290 


16 


Ll:057229.1 :2000FEB01 


12 






T95 



) 



PCT/US01/06059 



2088 


forward 1 


TM 




76 


forward 2 


TM 


Nin 


1732 


forward 2 


TM 


Nin 


1933 


forward 2 


TM 


Nin 


1484 


forward 3 


TM 


Nout 


1820 


forward 3 


TM 


Nout 


2036 


forward 3 


TM 


Nout 


2129 


forward 3 


TM 


Nout 


2237 


forward 3 


TM 


Nout 


867 


forward 1 


TM 


Nin 


505 


forward 2 


TM 


Nin 


853 


forward 2 


TM 


Nin 


842 


forward 3 


TM 


Nin 


348 


forward 1 


TM 


Nin 


651 


forward 1 


TM 


Nin 


747 


forward 1 


TM 


Nin 


467 


forward 3 


TM 


Nin 


2555 


forward 3 


TM 


Nin 


735 


forward 1 


TM 


Nin 


861 


forward 1 


TM 


Nin 


954 


forward 1 


TM 


Nin 


274 


forward 2 


TM 


Nin 


508 


forward 2 


TM 


N in 


895 


forward 2 


TM 


Nin 


1663 


forward 2 


TM 


Nin 


251 


forward 3 


TM 


Nin 


704 


forward 3 


TM 


N in 


1235 


forward 3 


TM 


Nin 


259 


forward 2 


TM 


Nout 


195 


forward 1 


TM 


Nout 


438 


forward 1 


TM 


N out 


591 


forward 1 


TM 


Nout 


744 


forward 1 


TM 


N out 


969 


forward 1 


TM 


Nout 


1062 


forward 1 


TM 


Nout 


388 


forward 2 


TM 


Nin 


613 


forward 2 


TM 


Nin 


1048 , 


forward 2 


TM 


Nin 


1246 


forward 2 


TM 


Nin 


1366 


forward 2 


TM 


Nin 


284 


forward 3 


TM 


Nout 


404 


forward 3 


TM 


Nout 


599 


forward 3 


TM 


Nout 


1115 


forward 3 


TM 


Nout 


1253 


forward 3 


TM 


Nout 


1373 


forward 3 


TM 


Nout 


699 


forward 1 


TM 


Nout 


682 


forward 2 


TM 


N in 


713 


forward 3 


TM 


Nin 


69 


forward 1 


TM 




198 


forward 1 


TM 




360 


forward 1 


TM 




67 


forwards 


TM 




226 


forward 2 


TM 




355 


forward 2 


TM 




71 


forward 3 


TM 


Nout 



I 



WO 01/62927 



16 


Ll:057229.1 :2000FEB01 


114 


17 


Ll:351 965.1 :2000FEB01 


487 


17 


U:351 965.1 :2000FEB0T 


1036 


17 


Ll:351 965.1 :2000FEB01 


492 


17 


Ll:351 965.1 :2000FEB01 


969 


17 


• Ll:351 965.1 :2000FEB01 


1098 


18 


LG:068682.1 :2000FEB18 


707 


19 


LG:242665. 1 :2000FEB1 8 


10 


19 


LG:242665.1:2000FEB18 


12 


19 


LG:242665.1 :2000FEB18 


333 


20 


LG:241 743.1 :2000FEB1 8 


43 


21 


Ll:034212.1 :2000FEB01 


1300 


21 


Ll:03421 2.1 :2000FEB01 


1570 


21 


Ll:034212.1 :2000FEB01 


2386 


21 


LJ.034212.1 :2000FEB01 


2533 


21 


Ll:034212.1 :2000FEB01 


2620 


21 


Ll:03421 2.1 :2000FEB01 


2740 


21 


Ll:034212.1 :2000FEB01 


719 


21 


Ll:03421 2.1 :2000FEB01 


1205 


21 


Ll:03421 2.1 :2000FEB01 


1460 


21 


Ll:03421 2.1 :2000FEB01 


1685 


21 


U:03421 2.1 :2000FEB01 


1814 


21 


(.1:034212.1 :2000FEB01 


2066 


21 


Ll:03421 2.1 :2000FEB01 


2156 


21 


Ll:034212.1 :2000FEB01 


2540 


21 


Ll:03421 2.1 -2000FEB01 


2657 


21 


Ll:03421 2.1 :2000FEB01 


12 


21 


Ll:034212.1 :2000FEB01 


1236 


21 


Ll:034212.1 :2000FEB01 


1590 


21 


Ll:03421 2.1 :2000FEB01 


1668 


21 


Ll'034212 1 2000FEB01 


2130 


21 


LI03421 2 1 -2000FEB01 


2295 


21 


LI034212.1 :2000FEB01 


2436 


21 


Ll'034212 V2000FEB01 


2538 


21 


Ll*034212 V2000FEB01 


2667 


22 


LG'344886 1*2000MAY19 


937 


22 


LG"344886 V2000MAY19 

fc»V^«w I I WWW. 1 »fc-WVWI»l#» ■ 1 W 


1081 


22 


LG:344886.1 :2000MAY19 


1696 


22 


LG:344886. 1 :2000MAY1 9 


413 


22 


LG:344886.1 :2000MAY19 


551 


22 


LG:344886.1 :2000MAY19 


950 


22 


LG:344886.1 :2000MAY19 


1031 


22 


LG:344886.1 :2000MAY19 


1112 


22 


LG:344886.1 :2000MAY19 


1271 


22 


LG:344886,1 :2000MAY19 


1634 


22 


LG:344886. 1 :2000MAY1 9 


567 


22 


LG:344886. 1 :2000MAY1 9 


1011 


22 


LG:344886.1 :2000MAY19 


1089 


22 


LG:344886.1 :2000MAY19 


1707 


23 


LG:228930.1:2000MAY19 


111 


24 


LG:338927.1 :2000MAY19 


934 


24 


LG:338927.1:2000MAY19 


1133 


24 


LG:338927.1 :2000MAY19 


1170 


25 


LG:898771 .1 :2000MAY19 


1261 


25 


LG:898771 . 1 :2000MAY1 9 


1397 


26 


LI:257664.67:2000MAY01 


280 




i 


196 



PCT/US01/06059 



176 


forward 3 


TM 


Nout 


573 


forward 1 


TM 




1098 


forward 1 


TM 




578 


forward 3 


TM 


Nin 


1055 


forward 3 


TM 


Nin 


1184 


forward 3 


TM 


Nin 


793 


forward 2 


TM 


Nout 


63 


forward 1 


TM 


Nout 


62 


forward 3 


TM 


Nout 


398 


, forward 3 


TM ■ 


Nout 


99 


forward 1 


TM 


Nout 


1365 


■ forward 1 


TM 


Nin 


1647 


forward 1 


TM 


Nin 


2472 


forward 1 


TM 


Nin 


2598 


forward 1 


TM 


Nin 


2706 


forward 1 


TM i 


Nin 


2826 


forward 1 


• TM 


N in 


805 


forward 2 


TM 




1291 


forward 2 


TM 




1546 


• forward 2 


TM 




1768 


forward 2 


TM 




1882 


forward 2 


TMt 




2128 


forward 2 


TM 




2218 


forward 2 


TM 




2626 


forward 2 


TM 




2734 


forward 2 


TM 




62 


forward 3 


TM 


Nout 


1301 


forward 3 


TM 


N out 


1646 


forward 3 


TM 


Nout 


1721 


forward 3 


TM 


N out 


2216 


forward 3 


TM 


N out 


2381 


forward 3 


TM 


N out 


2513 


forward 3 


TM 


N out 


2624 


forward 3 

1WI VIUIU w 


TM 


N out 


2735 


forward 3 


TM 


Nout 


1002 


forward 1 


TM 


N in 


1155 


forward 1 

1 WI VV Ml W 1 


TM 


N in 


1782 


forward 1 


TM 


N in 


463 


forward 2 


TM 


N in 


637 


forward 2 


TM 


N in 


1012 


forward 2 


TM 


Nin 


1093 


forward 2 


TM 


Nin 


1183 


forward 2 


TM 


N in 


1348 


forward 2 


TM 


Nin 


1720 


forward 2 


TM 


Nin 


626 


forward 3 


TM 


Nin 


1073 


forward 3 


TM 


Nin 


1151 


forward 3 


TM 


Nin 


1757 


forward 3 


TM 


N in 


167 


forward 3 


TM 


Nin 


1020 


forward 1 


TM 


Nout 


1219 


forward 2 


TM 


Nin 


1250 


forward 3 


TM 


Nin 


1314 


forward 1 


TM 


Nout 


1450 


forward 2 


TM 


Nout 


366 


forward 1 


TM 


Nin 



I 
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26 


LI:257664.67:2000MAY01 


26 


LI:257664.67:2000MAY01 


27 


LI.001 496.2.2000MAY01 


28 


Ll:1 085273.2:2000MAY01 


28 


' LI:1085273.2:2000MAY01 


28 


L(:1 085273.2:2000MAY01 


28 


LI: 1 085273.2:2000MAY01 


29 


Ll:3331 38.2-.2000MAY01 


29 


Ll:3331 38.2:2000MAY01 


29 


Ll:3331 38.2.2000MAY01 


29 


Ll:3331 38.2:2000MAY01 


29 


Ll:3331 38.2:2000MAY01 


29 


Ll:3331 38.2:2000MAY01 


29 


LI.3331 38.2:2000MAY01 


30 


Ll:338927.1 :2000MAY01 


30 


Ll:338927.1 :2000MAY01 


30 


Ll:338927.1 :2000MAY01 


30 


Ll:338927.1 :2000MAY01 


31 


LG:335558.1 :2000FEB18 


31 


LG:335558.1 :2000FEB18 


31 


LG:335558.1 :2000FEB18 


31 


LG:335558.1 :2000FEB18 


31 


LG:335558.1 :2000FEB18 


31 


LG:335558.1 :2000FEB18 


32 


LG:998283.7:2000FEB18 


32 


LG:998283.7:2000FEB1 8 


32 


LG:998283.7:2000FEB18 


32 


LG:998283.7:2000FEB18 


32 


LG:998283.7:2000FEB18 


32 


LG:998283.7:2000FEB1 8 


32 


LG:998283.7:2000FEB18 


32 


LG:998283.7:2000FEB1 8 


32 


LG:998283.7:2000FEB18 


32 


LG:998283.7:2000FEB1 8 


32 


LG:998283.7:2000FEB1 8 


33 


Ll:402739.1 :2000FEB01 


35 


LG:981 076.2:2000M AY1 9 


35 
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TABLE 5 

SEQ ID NO: Template ID 

1 LG:1040582.1 :2000FEB18 

2 LG:453570.1 :2000FEB18 
3' LG:408751.3:2000FEB18 

' 4 LI:090574.1:2000FEB01 

5 LI:229932.2:2000FEB01 

6 LI:332176.1:2000FEB01 

7 LI:403248.2:2000FEB01 

8 LG:220992.1:2000MAY19 

9 LG:1 094571 .1 :2000MAY1 9 

1 0 LI:350754.4:2000MAY01 

1 1 LI:255828.29:2000MAY01 

12 Ll:11 90263.1 :2000MAY01 

1 3 LG:27091 6.2:2000FEB1 8 



1 4 LG:99941 4.3:2000FEB1 8 

15 LG:429446.1:2000FEB18 

16 LI:057229.1:2000FEB01 

17 Ll:351 965.1 :2000FEB01 

1 8 LG:068682.1 :2000FEB1 8 

1 9 LG:242665. 1 :2000FEB1 8 

20 LG:241743.1 :2000FEB18 

21 LI:034212.1:2000FEB01 

22 LG:344886. 1 :2000MAY1 9 

23 LG:228930. 1 :2000M AY1 9 

24 LG:338927.1:2000MAY19 

25 LG:898771 .1:2000MAY19 

26 LI:257664.67:2000MAY01 

27 Ll:001 496.2:2000MAY01 

28 Ll:1 085273.2:2000MAY01 

29 Ll:3331 38.2:2000MAY01 

30 LI:338927.1:2000MAY01 

31 LG:335558.1 :2000FEB18 



Tissue Distribution . 

liver - 41%, Pancreas - 34%, Cardiovascular System 
-14% 

Nervous System - 1 00% 

Sense Organs - 63%, Nervous System - 22% 

Nervous System - 46%, Unclassified/Mixed - 36% 

Musculoskeletal System - 80% 

Urinary Tract -95% 

Respiratory System - 60%, Hemic and Immune 
System -40% 

Embryonic Structures - 17%, Male Genitalia - 12% 
Liver - 19%, Embryonic Structures - 16%, 
Cardiovascular System - 14% 
Skin - 47%, Stomatognathic System - 27%, Sense 
Organs- 14% 

Musculoskeletal System - 1 00% 

Urinary Tract - 80%, Urinary Tract - 15% 

Female Genitalia - 100% 

Embryonic Structures - 30%, Urinary Tract - 1 3%, 

Digestive System - 11%, Musculoskeletal System - 

11% 

Urinary Tract - 80%, Hemic and Immune System - 
20% 

Male Genitalia - 71%, Hemic and Immune System - 
29% 

Unclassified/Mixed - 53%, Male Genitalia - 12% 
Unclassified/Mixed - 49%, Male Genitalia - 27% 
Germ Cells - 47%, Female Genitalia - 13%, Male 
Genitalia- 12% 

Liver - 27%, Urinary Tract - 27%, Respiratory System - 
14% 

Digestive System - 24%, Musculoskeletal System - 

22%, Nervous System - 1 1% 

Germ Cells - 24%, Nervous System - 12% 

Embryonic Structures - 43%, Nervous System - 29%, 

Respiratory System - 14%, Hemic and Immune 

Svstem-14% 

Digestive System - 23%, Unclassified/Mixed - 21%, 
Embryonic Structures - 19%, Hemic and Immune 
Svstem-19% 

Pancreas - 13%, Embryonic Structures - 1 1%, 
Female Genitalia - 10%, Urinary Tract - 10%, Hemic 
and Immune System - 10%, Cardiovascular System - 
10% 

Hemic and Immune System - 1 00% 

Endocrine System - 27%, Female Genitalia - 25%, 

Embryonic Structures - 25% 

Digestive System - 29%, Skin - 24%, Endocrine 

System- 16% 

Exocrine Glands - 61%, Nervous System - 13%, 
Nervous System - 11% 

Embryonic Structures - 51%, Digestive System - 17% 



i 



Endocrine System - 45%, Nervous System • 
Exocrine Glands - 11% 
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CLAIMS 

What is claimed is: 

1 . An isolated polynucleotide comprising a polynucleotide sequence selected from the group 
5 consisting of: 

a) a polynucleotide sequence selected from the group consisting of SEQ ID NO:l-21 1, 

b) a naturally occurring polynucleotide sequence having at least 90% sequence identity to a 
polynucleotide sequence selected from the group consisting of SEQ ID NO:l-21 1, 

c) a polynucleotide sequence complementary to a), 1 
10 d) a polynucleotide sequence complementary to b), and 

e) an RNA equivalent of a) through d). 

2. An isolated polynucleotide of claim 1, comprising a polynucleotide sequence selected from 
the group consisting of SEQ ID NO:l-211. 

15 

3. An isolated polynucleotide comprising at least 60 contiguous nucleotides of a polynucleotide 
of claim 1. 

4. A composition for the detection of expression of diagnostic and therapeutic polynucleotides 
20 comprising at least one of the polynucleotides of claim 1 and a detectable label. 

5. A method for detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 1, the method comprising: 

a) amplifying said target polynucleotide or fragment thereof using polymerase chain reaction 
25 amplification, and 

b) detecting the presence or absence of said amplified target polynucleotide or fragment 
thereof, and, optionally, if present, the amount thereof. 



6. A method for detecting a target polynucleotide in a sample, said target polynucleotide 
3 o comprising a sequence of a polynucleotide of claim 1 , the method comprising: 

a) hybridizing the sample with a probe comprising at least 20 contiguous nucleotides 
comprising a sequence complementary to said target polynucleotide in the sample, and which probe 
specifically hybridizes to said target polynucleotide, under conditions whereby a hybridization complex 
is formed between said probe and said target polynucleotide or fragments thereof, and 
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b) detecting the presence or absence of said hybridization complex, and, optionally, if present, 
the amount thereof. 

7. A method of claim 5, wherein the probe comprises at least 30 contiguous nucleotides. 

5 

8. A method of claim 5, wherein the probe comprises at least 60 contiguous nucleotides. 

9. A recombinant polynucleotide comprising a promote sequence operably linked to a 
polynucleotide of claim 1. 

10 

10. A cell transformed with a recombinant polynucleotide of claim 9. 

1 1 . A transgenic organism comprising a recombinant polynucleotide of claim 9. 

15 12. A method for producing a diagnostic and therapeutic polypeptide, the method comprising: 

a) culturing a cell undo: conditions suitable for expression of the diagnostic and therapeutic 
polypeptide, wherein said cell is transformed with a recombinant polynucleotide of claim 9, and 

b) recovering the diagnostic and therapeutic polypeptide so expressed. 

20 13. A purified diagnostic and therapeutic polypeptide (DITHP) encoded by at least one of the 

polynucleotides of claim 2. 

14. An isolated antibody which specifically binds to a diagnostic and therapeutic polypeptide 
of claim 13. 

25 

15. A method of identifying a test compound which specifically binds to the diagnostic and 
therapeutic polypeptide of claim 1 3, the method comprising the steps of: 

a) providing a test compound; 

b) combining the diagnostic and therapeutic polypeptide with the test compound for a 
3 o sufficient time and under suitable conditions for binding; and 

c) detecting binding of the diagnostic and therapeutic polypeptide to the test compound, 
thereby identifying the test compound which specifically binds the diagnostic and therapeutic 
polypeptide. 
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16. A microarray wherein at least one element of the microarray is a polynucleotide of claim 3. 



17. A method for generating a transcript image of a sample which contains polynucleotides, 
the method comprising the steps of: 

a) labeling the polynucleotides of the sample, 

b) contacting the elements of the microarray of claim 1 6 with the labeled polynucleotides of the 
sample undo* conditions suitable for the formation of a hybridization complex, and 

c) quantifying the expression of the polynucleotides in the sample. 



18. A method for screening a compound for effectiveness in altering expression of a target 
polynucleotide, wherein said target polynucleotide comprises a polynucleotide sequence of claim 1, the 
method comprising: 

, a) exposing a sample comprising the target polynucleotide to a compound, under conditions 

15 suitable for the expression of the target polynucleotide, 

b) detecting altered expression of the target polynucleotide, and 

c) comparing the expression of the target polynucleotide in the presence of varying amounts of 
the compound and in the absence of the compound. 

20 19. A method for assessing toxicity of a test compound, said method comprising: 

a) treating a biological sample containing nucleic acids with the test compound; 

b) hybridizing the nucleic acids of the treated biological sample with a probe comprising at 
least 20 contiguous nucleotides of a polynucleotide of claim 1 under conditions whereby a specific 
hybridization complex is formed between said probe and a target polynucleotide in the biological 

2 5 sample, said target polynucleotide comprising a polynucleotide sequence of a polynucleotide of claim 1 
or fragment thereof; 

c) quantifying the amount of hybridization complex; and 

d) comparing the amount of hybridization complex in the treated biological sample with the 
amount of hybridization complex in an untreated biological sample, wherein a difference in the amount 

30 of hybridization complex in the treated biological sample is indicative of toxicity of the test compound. 

20. An array comprising different nucleotide molecules affixed in distinct physical locations on 
a solid substrate, wherein at least one of said nucleotide molecules comprises a first oligonucleotide or 
polynucleotide sequence specifically hybridizable with at least 30 contiguous nucleotides of a target 
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polynucleotide, said target polynucleotide having a sequence of claim 1. 

21 . An array of claim 20, wherein said first oligonucleotide or polynucleotide sequence is 
completely complementary to at least 30 contiguous nucleotides of said target polynucleotide. 

22. An array of claim 20, wherein said first oligonucleotide or polynucleotide sequence is 
completely complementary to at least 60 contiguous nucleotides of said target polynucleotide 

23. An array of claim 20, which is a miaoarray. 

i 

24. An array of claim 20, further comprising said target polynucleotide hybridized to said first 
oligonucleotide or polynucleotide, 

25. An array of claim 20, wherein a linker joins at least one of said nucleotide' molecules to 
said solid substrata 

26. An array of claim 20, wherein each distinct physical location on the substrate contains 
multiple nucleotide molecules having the same sequence, and each distinct physical location on the * 
substrate contains nucleotide molecules having a sequence which differs from the sequence of 

2 o nucleotide molecules at another physical location on the substrate. 

27. An isolated polypeptide comprising an amino acid sequence selected from the group 
consisting of: 

a) an amino acid sequence selected from the group consisting of SEQ ID NO:21 2-422, 
25 b) a naturally occurring amino acid sequence having at least 90% sequence identity to an 

amino acid sequence selected from the group consisting of SEQ ID NO:21 2-422, 

c) a biologically active fragment of an amino acid sequence selected from the group 
consisting of SEQ ID NO:212-422, and 

d) an immunogenic fragment of an amino acid sequence selected from the group consisting 
30 of SEQIDNO:212-422. 



! 292 

I 
I 



10 



BNSDOCID: <WO 01 62927 A2_L> 



WO 01/62927 



PCT/US01/06059 



<110> INCYTE GENOMICS 
PANZER, Scott R. 
SPIRO, Peter ,A. 
BANVILLE, Steven C. 
SHAH, Purvi 
CHALUP, Michael's. 
CHANG, Simon C. 
CHEN, Alice 
D'SA, Steven A. 
AMSHEY, Stefan 
DAHL, Christopher R. 

DAM, Tarn C. ' 

DANIELS, Susan E . 

DUFOUR, Gerard E. 

FLORES, Vincent 

FONG, Willy T. 

GREENAWALT, Lila B. 

HILLMAtt, Jennifer L . 

JONES, Anissa L* # 

LIU, Tommy F. * 

ROSEBERRY Ann M. 

ROSEN, Bruce H. 

RUSSO, Frarik D . 

STOCKDREHER, Theresa K. 

DAFFO, Abel 

WRIGHT, Rachel J. 

YAP, Pierre E . 

YU, Jimmy Y. 

BRADLEY , Diana ,L . 

BRATCHER, Shawn R. 

CHEN, Wensheng 

COHEN, Howard J. 

HODGSON, David M. 

LINCOLN, Stephen E. 

<120> MOLECULES FOR DIAGNOSTICS AND THERAPEUTICS 

<130> PT-1113 PCT 



<140> To Be Assigned 
<141> Herewith 

<150> 60/184,777; 60/184,797; 60/184,698; 60/184,770; 

60/184,693; 60/184,771; 60/184,813; 60/184,773; 

60/184,769; 60/184,768; 60/184 , 837 ;. 60/184 , 697 ; 

60/184,772; 60/185,213; 60/185,216; 60/204,863; 

60/204,815; 60/203,785; 60/204,821; 60/204,908; 



60/204,525; 
60/205,324; 



60/205,285; 
60/205,286 



60/205,232; 60/205,323; 



60/184,774 
60/184,776 
60/184,841 
60/205,221 
60/204,226 
60/205,287 



2000- 


02 


-24; 


2000 


-02 


-24; 


2000- 


02 


-24; 


2000- 


02 


-24; 


2000- 


02- 


24; 


2000- 


02 


-24; 


2000 


-02 


-24; 


2000- 


02 


-24; 


2000- 


02 


-24; 


2000- 


02- 


24; 


2000- 


02 


-24; 


2000 


-02 


-24; 


2000- 


02 


-24; 


2000- 


02 


-24; 


2000- 


02- 


24; 


2000- 


02 


-24; 


2000 


-02 


-24; 


2000- 


02 


-24; 


2000- 


02 


-24; 


2000- 


05- 


17; 


2000- 


05 


-17; 


2000 


-05 


-12; 


2000- 


05 


-16; 


2000- 


05 


-16; 


2000- 


05- 


15; 


2000- 


05 


-16; 


2000 


-05 


-17; 


2000- 


05 


-16; 


2000- 


05 


-17; 


2000- 


05- 


17; 



2000-05-17; 2000-05-17 



<160> 422 

<170> PERL Program 



<210> 1 
<211> 543 



1/228 



BNSOOCID: <WO 0162927A2_I_> 



WO 01/62927 




PCT/US01/06059 



<212> DNA HB 

<213> Homo sapiefflT 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 1040582 .1 :2000FEB18 

i * 

<400> 1 

cggccgtcca gacgtcgtgg tggagtgaga ggttcccagg gaggaatgtg caaagtcctg 60 
ttggttctac ccctcttcta cttcgaggaa ggggtccgcg tcgcgcgttt ctggaacggg 120 
ctctctccct ggggtggctg ctgttctccg ccagattaag ccgggcctgc tggctcagta 180 
ctggagtgca gagctgaatt ctggccactt tgtcttctcc acagcctgtg ctcattgcca 240 
aggggacaat gacggcctcc agtgtcctcc tgcacactgg acagaagatg cctctgattg 300 
gtctggggac atggaagagt gagcctggtc aggtgaaagc agctattaaa tatgccctta 360 
gcgtaggcta ccgccacatt gactgtgctt ctgtatatrgg caatgaaact gagattgggg 420 
aggccctgaa ggagagtgtg gggtcaggca aggcagtccc tcgagaggag ctgtttgtga 480 
catccaagct gtggaatact aagcaccacc ctgaggatgt agaacctgcc ctccggaaga 540 
cac 543 

<210> 2 

<211> 633 

<212> DNA 

<213> Homo sapiens 

<220> 
1 <221> misc — feature 

<223> Incyte ID No: LG : 453570 . 1 :2000FEB18 

<400> 2 

cctgcacgct cctccgcctg attcccagcg cgtcgcactt caagagattc gatcgggttc 60 
' gccggttcgc gcccgccgcc atggcgactt cctcaggacc aaaggaggcg ccagcgaata 120 
accctgggct ccagaccgag gtcgaccccg ccaccaaagg ctacttcttg caacaaacga 180 
tgttacgtgt gaaggaccca aaagtgagcc ttgacttcta ctcacgtgtg atgggcatgt 240 
cgttgctgaa aaggttggat tttgaagaga tgaaattcag tttgtatttt ctaggttacg 300 
aggatgtgac cttagcccct gatgatcata tcaagcggac tgaatggact tttaggcaaa 360 
aagctactct tgagctcacc cacaactggg gcacggaaaa tgaccctgaa ttcaaaggtt 420 
accataatgg gaactcagac cctcgtggtt ttggtcatat aggagtaact gtggatgatg 480 
tccactaggc atgtgaacgc tttgaacgtc ttggggttga gtttgtgaag aaacctgatg 540 
atggaaaaat caatggcatc gctttcatca aagatcctga tggttatgga tcgaaatatt 600 
tgaccagacg attgggacgg ttacttcttc age 633 

<210> 3 

<211> 1674 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: LG:408751 . 3 :2000FEB18 



tagggaccca ggatggcaga tccgggaccg ggctgggctg gcttggaaca tgcttgccaa 60 
ctcagccagc gtgaggatcc tcatcaaggg aggcaaggtg gtgaacgatg actgcaccca 120 
cgaggctgac gtctacatcg agaatggcat catccagcag gtgggccgcg agctcatgat 180 
ccctggcggg gccaaggtga ttgatgecac aggaaaactg gtgatccctg gtggcatcga 240 
caccagcacc cacttccacc agaccttcat gaatgccacg tgcgtggacg acttctacca 300 
tgggaccaag gcagcactcg teggaggcac caccatgatc atcggccacg tcctgcccga 360 
caaggagacc tcccttgtgg aegcttatga gaagtgccga ggtctggccg accccaaggt 420 
ctgctgtgat tacgccctcc acgtggggat cacctggtgg gcacccaagg tgaaagcaga 480 
aatggagaca ctggtgaggg agaagggtgt caactcgttc cagatgttca tgacctacaa 540 
ggacctgtac atgettcgag acagtgagct gtaccaagtg ttgeacgett gcaaggacat 600 
tggggcaatc gcccgcgtcc atgctgaaaa tggggagctt gtggccgagg gtgctaagga 660 
ggcactggat ttggggatca caggcccaga aggaatcgag atcagccgtc cagaggagct 720 
ggaagctgaa gccactcatc gtgttatcac cattgeaaac aggactcact gtccaatcta 780 
cctggtcaac gtgtccagta teteggctgg tgacgttatc geagctgeta agatgeaagg 840 
gaaggttgtg ctggcggaga ccaccactgc acatgccacg ctgacaggct tacactacta 900 
ccaccaggac tggtcccacg cggctgccta tgtcacggtg cctcccctga gactggacac 960 



<400> 3 



2/228 



BNSDOCiO: <WO 0162927A2_I_> 



WO 01/62927 



caacacctca acctacctca^^agcctgct 

cagggatgtg tacatcttta ggaagacgtc 
ggtttgattt cattgtgcpc ataggatcat 
agataattgc catcctattg ggattttatg 
gtataagagt gaagagcaca tatagaactg 
cattgatctc ttactgctaa agcccagtgg 
ctagaatgta ' ggggtccttt ctgtttcttt 
cgtgtgcccc ttgtctgtac atccctcctt 
tccctttttc ttctgcactt tctccccctt 
ttctccagct cttctatttc tgacactgct 
atggtttgat gaacggtctg aattcagtgg 
cccaataact gcttgcagct tctttctctc 

<210> 4 

<211> 1248 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: LI : 09 0574 . 1 : 2000FEB01 



ggccaagtaa 
atagagggcc 
tgtgcttttc 
accgcttgat 
tgaaccgtgg 
ggagctgcag 
tgttcacctc 
ctgtgggttt 
tccttcccac 
ttttccccat 
gacccaatta 
tcctattaac 



PCT/US01/06059 



ggcg 1 1 1 cag^^Sgcac a t tg 

caggaaacaa atctgagcta 
ttaccttctc tgagcccatc 
atggagtgag atcttctgaa 
ttattcggaa tttcaaagga 
aacgtggcct gtgacttatc 
tccctcttct tgttggctcc 
ctcctccttt ccttcccctt 
acccttcctc cagatctcct 
gcaatcccaa tctaaaggaa 
tggaagggtt tctgttttat 
atttttgggg ccag , 



<220> 

<221> unsure 
<222> 1148, 1234 
<223> a, t, c, g, 



or other 



<400> 4 

cgcgtacgta 

gctggggata 

atggtgatca 

tccgaccact 

attccttcaa 

tcactggaag 

gctccgagca 

attcagacaa 

tgggagtgtt 

tggattccat 

tgcttccacc 

ttgagagtgt 

ccaaatttcg 

acttggccac 

ctgggtccca 

gagaagtatt 

taggcataaa 

atgccaaggc 

ttgttataag 

aggtgggntg 

acccattaat 



agctcggaat 
ccgcgagcac 
gcaatctcca 
tagtatcaag 
tgttgacttt 
ctacaggtta 
catagtagat 
ataccccagc 
tttacagatt 
taaagaaaag 
atcctgggac 
cacatggatt 
cagtctcctg 
cattccctgt 
aactgaatcc 
cattaacagt 
ataaggagaa 
tgcttgaacg 
ttttccacaa 
gactagatca 
taaaagcttt 



tcggctcgag 
aacggtccta 
attgagatta 
tatgacccaa 
gatgacacag 
cggcaggttc 
ggagtgagct 
tttgttgagg 
ggtgaaccta 
ggtaaacaaa 
tactggacat 
gttttaaagc 
tgcacagcgg 
gggagcagct 
actgggcctt 
aaacattagt 
gtggcctcgg 
ttggattact 
gtggcataga 
agggtcttca 
ttaatcaaaa 



ttccaccccg 
ttcactggaa 
aaaccaaaga 
get cage taa 
agaacaaatc 
accttcactg 
atgetgeaga 
cagctcatga 
attcccaact 
ctcgattcac 
atcctggttc 
aacctataaa 
agggtgaagc 
ctgccacctg 
gettttgetg 
tgtgctcagc 
gtgaaagcag 
cttcacctat 
gtctcttctg 
gcagttttga 
gctntcttac 



agggaccatg 
ggaatttttc 
agtgaaatat 
aatcatcagc 
agttctgcgt 
ggggtccget 
gctccatgtt 
accagatgga 
gcaaaagatt 
aaattttgac 
tcttacagtt 
catcagctct 
agcagctttt 
ctctgggcat 
ctgctaggga 
cctctgcagt 
gtttggatta 
acttctataa 
tgaacactgt 
teagecagtg 
aaagggcg 



tcgaggctca 
ectattgetg 
gactcttccc 
aacagcggcc 
ggtggtcctc 
gatgaccacg 
gttcactgga 
ctggctgtct 
actgacactt 
ctattgtctc 
ccacctcttc 
caacagctgg 
ctggtgcatc 
tgttaatcct 
agagaaggca 
ctacctctgc 
aaatttccta 
atgtgtaatc 
tcttctgtgc 
gatttcaaag 



<210> 5 

<211> 1451 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 



LI: 22 9 932 .2:2000FEB01 



1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1674 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1248 



<400> 5 

taatgattca 

ttttgaccag 

ttattgattt 

cgctgtcatg 

gtgggcagct 

tcttcatggg 

egcttattte 



aagataattt 
gaacttttaa 
taatttcact 
ctacactggt 
tctgacttag 
aacacaacag 
cgtgtgcttg 



tataaaagee 
agtggatgct 
ttttctctcc 
caatgtcttc 
gctgggttgt 
ttttatatga 
-cagagcatgg 



aaagtaactt 
atgaagatag 
ttagggtgtg 
catataegga 
tggacattcc 

ggggaagect 

agtagctgee 



gtagctaata 
agagtgtttt 
attaggecca 
cttcaacccg 
tatatctget 
gtgggaacac 
ttgtttacag 



tataaaatac €0 
tacacattgt 120 
ctgggggata 180 
gagaggtgtg 240 
atggacctct 300 
cagatgetgg 3€0 
caccaactgc 420 



3/228 



BNSDOCID: <WO 01 62927A2_I_> 



WO 01/62927 



itratttc 



aattagagca atc< 
aaggttcaaa acattStttg 
caaaaatgtc ttcagagtac 
aattactgcg tcatgtgttg 
aaaaagcgcc caggatacaa 
ggtgtttagg aaatattgtg 
tggaagaatc aggaagcatt 
accatggatg ctggttacat 
gatgtaataa atgttgcagg 
tcccatggta ccgtggcaga 
gtccccttag cactctgtgt 
gaagaaattg tgaaacacgt 
gtgtttgtca aacagctacc 
gccattgtca atggcaagcc 
ggccacgtag aagaaatgct 
tgatttaatt tcttaattga 
agtagaaatt acttgcaaaa 
gaaaacatgt g 

<210> 6 

<211> 2615 

<212> DNA 

<213> Homo sapiens 



aggaccctgg 
tggctggaga 
ctgtcttaga 
gattaggcaa 
tgttatgatt 
gtaaaagtta 
caagcattta 
ggatgaagaa 
tcacagaatt 
ctgtgctgtt 
attgagaaaa 
tagacagaac 
caaaaccaga 
atacaagata 
gaagcaagca 
aattaaatta 
tgaaatgtga 



ggcagctttg 
acgatgtgat 
ccattggtgg 
ttctaaaaca 
ttggatgaca 
,ccattgccac 
tactttgaaa 
.ggctatttgt 
tctgcaggcg 
gttggcaagg 
gatataaatg 
attggccctg 
tctggcaaga 
acttctacaa 
taatgagttt 
tttgagttgt 
attgtaaaac 



agagac 



PCT/US01/06059 



ggg^pagt 

gtagagaccc 
caaactgaga 
cctccaccag 
acatgcaaaa 
ctggggcttt 
aatttcctgg 
atgttatgtc 
ccattgaaga 
aagatccctt 
caacagagga 
tggctgcttt 
tcccccgatc 
ttgaagaccc 
gtcttattcc 
ttctcagaaa 
ttggcctaaa 



actctctgac 
tggaatggtc 
ctggatctcc 
ggcaagcagg 
ctgaaggctc 
ttcaggactc 
atattatgat 
tcgagtggat 
gtcaatcctt 
aaaaggtcat 
gcaagttttg 
tcgaaatgca 
agctttatct 
cagcattttt 
tattttgagt 
taaatatttc 
caaatctaat 



480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1451 



<220> 

<221> misc_feature 
<223>Incyte ID No: 



LI: 332176. 1:2000FEB01 



<400> 6 
actgataaaa 
tggatcacaa 
' gaaaatttct 
tgacataggc 
tggagcctac 
ctggctcttt 
cctcctgcca 
gaggcctctg 
catgctgggg 
tgtgtttctt 
aggagggtgt 
tggaagtgtg 
ctcctatgga 
tgatgatggc 
attctgttcc 
gtgtgtggtg 
ccactcatct 
cctgagcctg 
acaaagaact 
tcaccttttt 
cattcgtcac 
atgttataga 
tagccctgag 
ctgcgtgggt 
gcctcaggcc 
tccaagtgga 
gcagagcagc 
tagagatcga 
gtttctaatt 
actaaccaaa 
ccatgtaccc 
gggagaccct 
taagtaatat 
gaacaaaata 
cccaggctct 
gtgcccaagg 
cagcctcgac 
agcaacttgc 



cgatctaaag 
aagtatggtg 
atcccaatgt 
gggttcattg 
cagcccttct 
ggggaggaac 
tattggtatt 
tgggtagagt 
agtgcattat 
ccaggatcaa 
actgtaaaga 
ataccaataa 
ctccgggttg 
cattcattcc 
agtgttctga 
gagaagatct 
gatggtaaag 
gagaagctct 
gcccctggtg 
tgagattttt 
cattatacta 
tgtactaaca 
acatttatag 
ctgttttaac 
atgcagcatt 
cagcagcctc 
cctcttctgt 
cttggcagca 
aatcttaaaa 
taatatttat 
accactggac 
ggccttgggc 
ttcaatccaa 
ttaaaattta 
aatatggctt 
ctccacagac 
tcctcagcac 
taacggccac 



ggaaggagag 
ccgtgtggac 
tactcactct 
ggaatccaga 
tccgtggcca 
acacccgact 
ctctgttcta 
tccctgatga 
gggttcatcc 
atgaggtctg 
tcccagtagc 
agacaactgt 
ctctaagcac 
aatacctcca 
tcaatagttt 
tggtcttagg 
atcagcctgt 
cactcaacat 
atgtgagcag 
gctgcaatct 
atgaacaata 
atattccttg 
cgttcaggag 
gtgggccaag 
ggcgctctgg 
tggtactccc 
gtgcactgca 
cgaaggattc 
tccatgtctt 
aacatgagta 
ttcagaagta 
acaaaatgta 
tatttttaaa 
aaataaagac 
ggcatggggc 
ctcagtggct 
tcctcagcac 
ctctgtgcct 



accctttgtt 
aggcgacaac 
cagcattact 
gacagagctg 
tgccaccatg 
catccgagaa 
ccatgcacac 
actaaagact 
agtcacagaa 
gtatgactat 
cttggacact 
aggaaaatcc 
taagggttct 
ccagaagcaa 
tgctgaccag 
cttcaggaag 
ggcttttacg 
tgccactgac 
ggacctgcct 
gtttgccttc 
gatttcatgt 
tatcaaacat 
tcttctattg 
cctacctggg 
ctgcagcagc 
cccatttatc 
tacgctgcag 
ttttctcttt 
ttacattgtt 
agctataatt 
gaactcatga 
agggatgcca 
aatcagaatt 
aggattagta 
agaacattac 
ccctgctgcc 
acacctcttc 
tctgatccct 



cttacacgtt 
acagcagaat 
gggatctctt 
ctagtgcgtt 
aacaccaagc 
gccatcagag 
gtggcttccc 
tttgatatgg 
ccaaaagcca 
aagacatttg 
attccagtgt 
acaggctgga 
tcagtgggtg 
tttttgcaca 
aggggtcatt 
gagccatctt 
tattgtgcca 
tgggaggtcc 
gcccctttca 
cctgaatcaa 
ttcaaaattt 
ctccttttct 
cttccattcc 
cagcccattt 
tgagttgctc 
ttccacccac 
ccgtgggagt 
catgcttctc 
tttttaatta 
aataacaatg 
ctgggactag 
aaaaaataca 
aatgcaaaaa 
ttactgagtt 
aacataccag 
tgccacagca 
ttatcaggct 
gggcgccaat 



ctttctttgc 
ggagcaactt 
tttgcggagc 
ggtaccaggc 
gacgagagcc 
agcgctatgg 
aacctgtcat 
aagatgaata 
ccacagttga 
ctcatgggga 
ttcagcgagg 
tgactgaatc 
agttatatct 
ggaagttttc 
atcccagcaa 
ctgtgactac 
aaacatccat 
gcatcatatg 
acctttcccc 
aataatcttt 
cagattttac 
ccctgataca 
ttcagcaggg 
gccagggctt 
aaggccagtg 
atggagctgg 
tattctcccc 
aggctcaata 
agtgctgttt 
aaataaatac 
gatgaggcaa 
gtaatcaaag 
aaaccatgat 
ttccttttgt 
tcgtgtcatg 
tctgttttag 
tcctccactt 
atcctcctgc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 



4/228 



BNSDOCID: <WO 0162927A2_I_> 



WO 01/62927 



ccttaccatc cttccaggcc||^acttaaat 

acacccctac ccccataccc attagcagtg 
ttataactgt gctctactta gcattctcag 
gcgttctcct actagattgt atgtccctca 
agagcactat tatacctgac tttcagtaac 
taattgatta aaaaattacg attgaatgta 

<210> 7 

<211> 491 

<212> DNA 

<213> Homo sapiens 



cccactttcc 
attttgccct 
ggatcatacc 
agagcatgtt 
! tgttagctgt 
aaaaa 



catgaagcc 

tccccgtaat 

.ttaatgtttt 

ctgtttctct 

gattagttag 



PCT/US01/06059 

ctgcgtga 2340 
gctgtcccac 2400 
cagtatgtct 2460 
tctgtctgac 2520 
ctggtggatt 2580 
2615 



<220> 

<221> mis cofeature 
<223> Incyte ID No: 



LI: 403248. 2 :200 0FEB01 



<400> 7 

gcctggcagg 

ggcgtgtatg 

cagaagtcta 

gtgtagcgct 

cccggcggtt 

aggggactgg 

ttgggggacc 

gcgaaggcag 

gtcctgcggc 



aggaggaaaa 
ggatcttcaa 
agaaccactc 
cttttgaaac 
gatggctgtg 
ctggtctttg 
caggcctgcc 
ctgatggctg 
t 



tggttccacc 
ctcagtcctg 
accctgctac 
catccacgga 
attgcgtggc 
acaacatggg 
acatcaccta 
cagaacagga 



tccaagacca 
tttgacaaca 
atgtctctag 
gcagcccctg 
tgagggcctg 
cgcctacact 
tgccatgtcc 
ggatgacgtg 



tcgtgtacca 
tctgccctac 
agtccattca 
tacagcagca 
tggctgccgc 
gtgggcatgg 
cgggtggcct 
gagggtgtgt 



ccttgatgag 60 
ccccatcctg 120 
tttcattgcc 180 
gcctgtgggg 240 
aactacacgt 300 
gttccccctt 360 
gggaagcgct 420 
gcaagcctct 480 
491 



<210> 8 

<211> 2309 

,<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 220992 . 1 : 2000MAY19 



<220> 

<221> unsure 
<222> 1738 
<223> a, t, c, 



g, or other 



<400> 8 

aattaaagat 

tgcctggctc 

aagtttctgc 

tctctcttat 

ccttaaaaca 

tagatggaat 

gtgcatcaaa 

tgttggatca 

ctgggagcac 

atagagaaaa 

ctggaggaag 

taggatcaga 

tcaaaccaag 

atgtgccagg 

ccggacctga 

ttcccagttt 

cggaattatc 

agggggccaa 

atgtattgtg 

gtcacagatg 

ggtttaatga 

actatgaaaa 

taaatgcttt 

taccatactt 

ttacacaagc 



ggctgcgccc 
ctgtggtggc 
ggcactgaaa 
caagaagacc 
agctgaagaa 
tcctattgca 
tatgctgaaa 

gggagctcta 

agatggtgta 
gaggaagcag 
ctcaggtggg 
tacaggagga 
ctatggctta 
aatcttaacc 
ccccagggac 
ggcagatgtg 
aagtgaagta 
agtaattgaa 
cacatcagaa 
tgacattgat 
tgtggtgaga 
ttattttgtc 
taactctgga 
ggagttcatc 
tgtaaatatg 



atgtaacatc 
aggctgggca 
caaggccaaa 
aagtttctaa 
tcagaaaaga 
gtaaaagaca 
ggttatatac 
ctaatgggaa 
tttggaccag 
aatccccaca 
agtgcagctg 
tcgaccagaa 
gtttcccgfcc 
agatgtgtgg 
tctaccacag 
agcaaactat 
cagtctcttt 
gtatcccttc 
gtggcatcga 
gtgtccactg 
ggaagaattc 
aaagcacaga 
gtagatgtct 
aaagaggaca 
gcaggattgc 



actagcgacc 
cgaggaccat 
ttacaccaac 
atgcctacat 
gatataagaa 
atttcagcac 
caccttataa 
aaacaaattt 
ttaaaaaccc 
gcgagaatga 
ctgtatcggc 
atcctgctgc 
atggtctcat 
atgatgcagc 
tacatgaacc 
gtataggaat 
ggtccaaagc 
ctcacaccag 
atatggcaag 
aagccatgta 
tctcaggaaa 
aagtgagacg 
tgctaactcc 
acagaacccg 
cagcagtgag 



ggtgacctct 
gctgggccgg 
agagctctgt 
tactgtgtca 
tggacagtca 
ttctggcatt 
tgctacagta 
agatgagttt 
ctggagttat 
agattcagac 
gttcacatgc 
ccactgtggg 
tcccctggtg 
aattgtgttg 
tattaataaa 
tccaaaggaa 
tgctgacctc 
ttattcaatt 
atttgatggg 
tgctgcaacc 
ctttttctta 
ccfccattgct 
caccaccttg 
aagtgoccag 
tatccctgtt 



ttttccocct 
agcctccgag 
caaaaatgtc 
gaagaggtgg 
cttggggatt 
gagacaacat 
gttcagaagt 
gctatgggat 
tcaaaacaat 
tggctgataa 
tacgcggctt 
cttgttggtt 
aattcgatgg 
ggtgcactgg 
ccattcatgc 
tatcttgtac 
tttgagtctg 
gtctgctacc 
ctacaatatg 
agacgagaag 
ttaaaagaaa 
aatgactttg 
agtgaggcag 
gatgatattt 
gcactctcaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 



5/228 



BNSOOCID: <WO 0 1 62927A2 J_> 



WO 01/62927 



accaagggtt gccJB^ga ctgcagttta ttggacgtgc gtt^Mgac cagcagcttc 
ttacagtagc caaacfgttt gaaaaacaag tacagtttcc tgttartcaa cttcaagaac 
tcatggatga ttgttcagca gtccttgaaa atgaaaagtt agcctctgtc tctctaaaac 
agtaaacata tcttacaaat taaaatgact tttaggctgg gattacaggc gttagcancc 
atgcccggct taaaactggc tttttaaatg gaacttttgt ttattttatt ttatttcttc 
cttccctttt tttacaattt gaatggaact tttagctttg ccgtgctgcg ttaggaaatg 



PCT/US01/06059 



aagacacatc ctgaactgga , acaagacttg 
taacttcaca, atccaaacca cacgatgtta 
gcagggctta aatgtactct tttccagttt 
gtcgctttgg tcgtgcattg aatgttctct 
tatagtaatt cagagggggt gaactcttgc 
gcattgttag ttttagggca gatagccttt 
caattcaact gaccatttga ccttaaaagt 
ctaccgtaaa tcaggcgtat tgcttgtca 

<210> 9 
<211> 930 
<212> DNA 

<213> Homo sapiens 



ccaggggtga actggcaaac tttcctgtac 
tttcttatag tggctgtaaa ggaagaagct 
ctggatttat ctgtagttta cttttatttt 
tgactttcaa agatttgaaa atatatttaa 
atggtttgat taaaattata gccttaaaaa 
ggattcaagt atatcttgtc tttcatgtat 
tggtattttt aattccgaat tatttgtgtg 2280 

2309 



1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 



<220> 

<221> mis cofeature 
<223> Incyte ID No: 



LG: 1094571 . 1 : 2000MAY19 



<400> 9' 

cgcacgccgg cggcgc'ggag gccggctctg cgcttcgggc cgccccctcc ccccaccccg 60 
ctcacactcg gcacttactt cggctgtctc cgctgccctc cagcggagac gcagctcctc 120 
aggcgcccgg cggtatttgt tgggtcggcg gcgtcaggga ttcgcagtgg cctgtggtcg 180 
gcgtcgtccg gccactggtg cgcccccgcg gcaggcagag ctcacgctcc tgtcccccgg 240 
ctggtccggg gtctgggcgc cgcgtcgacg gcggctccgc aggacgcgca gaccgggccg 300 
cagcccatgc cccgagcgga ctgcattatg aggcacctgc cttacttctg ccggggtcaa 360 
gtggtgcggg gcttcggccg cggctccaag cagctgggca tccccacagc taattttcct 420 
gagcaagtgg tagataatct tccagctgat atatccactg gtatttacta tggttgggcc 480 
agtgttggaa gtggagatgt ccataagatg gtggtgagca taggatggaa cccatattac 540 
aagaatacga agaagtctat ggaaacacat atcatgcata ccttcaaaga ggacttctat* 600 
ggggaaatcc tcaatgtggc cattgttggc tacctgagac cagaaaagaa ctttgattct' 660 
ttagagtcac ttatttcagc aattcaaggt gatattgaag aagctaagaa acgactagag 720 
ttaccagaac atttgaaaat caaagaagac aatttcttcc aggtttctaa aagcaaaata 780 
atgaatggcc actgatgaaa aattgtatta tttattcatt cactgttttc tagtgtttct 840 
gtgtttatta ctgttcagca tcatcttggt tatatactga aaatcaaaca ctttactaca 900 
gttgtgaatt agtttaaacc gtacaatgtt 930 

<210> 10 
<211> 2700 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<223> Incyte ID No: 



LI:350754.4:2000MAY01 



<400> 10 

gcactgtccg tgccattccc agaggagcct gagaagaggc agaggaaggg cgaaacatgg 60 
ctgctctagg agtccagagt atcaactggc agaaggcctt caaccgacaa gcgcatcaca 120 
cagacaagtt ctccagccag gagctcatct tgcggagagg ccaaaacttc caggtcttaa 180 
tgatcatgaa caaaggcctt ggctctaacg aaagactgga gttcattgac accacagggc 240 
cttacccctc agagtcggcc atgacgaagg ctgtgtttcc actctccaat ggcagtagtg 300 
gtggctggag tgcggtgctt caggccagca atggcaatac tctgactatc agcatctcca 360 
gtcctgccag cgcacccata ggacggtaca caatggccct ccagatcttc tcccagggcg 420 
gcatctcctc tgtgaaactt gggacgttca tactgctttt taacccctgg ctgaatgtgg 480 
atagcgtctt tatgggtaac catgctgaga gagaagagta tgttcaggaa gatgccggca 540 
tcatctttgt gggaagcaca aaccgaattg gcatgattgg ctggaacttt ggacagtttg 600 
aagaagacat tctcagcatc tgcctctcaa tcttggatag gagtctgaat ttccgccgtg 660 
acgctgctac tgatgtggcc agcagaaatg accccaaata cgttggccgg gtgctgagtg 720 
ccatgatcaa tagcaatgat gacaatggtg tgcttgctgg gaattggagc ggcacttaca 780 
ccggtggccg ggacccaagg agctgggacg gcagcgtgga gatcctcaaa aattggaaaa 840 
aatctggctt cagcccagtc cgatatggcc agtgctgggt ctttgctggg accctcaaca 900 



6/228 



BNSOOCID: <WO 0162927A2J_> 



WO 01/62927 



1 c 



cagcgctgcg gtctttgggg^^tccttccc 

acacagaccg aaatctcagt gtggatgtgt 
agggtagtga tagcgtatgg aatttccatg 
acctgggccc cccgtacggt ggatggcagg 
aaggggtgtt ccagtgcggc cccgcttpgg 
tgaacttcga catgcccttt atcttcgcgg 
acgacaacac ' cactggcaaa -cagtggaaga 
acatcagcac caaggcggtg ggcagcaatg 
acccagaagg ctctgaccag gaaagacaag 
ccaacacgcc atttgccgcg acgtcttcga 
gcatcatcgg gaagctgaag gtcgctggca 
tcctactgct caaaaacctg agcagggata 
ggaccatcat ctacaacggc acgcttgtac 
ccctggaccc tgaggaagag gcagaacatc 
ggtacctgaa gtcagacaac atgatccgga 
ctgaggtggt ggtggagcgg gacatcatcc 
tgaacgaggc tcgtgtgcgg aagcctgtga 
atgagccggt gagggactgc gtgctgatgg 
tgaagatcga cgtgccgacc ctagggccca 
tgccctcccg gagtggcacc aagcaactgc 
caatcaaggc catgttgtcc atcgacgtag 
caaacttgga caacacggag cagggagagc 
tgtccggcct gggtaaaccc tctccatctc 
ccagcacatc cccctctcct ctcccccagg 
tgatcacttt tgcacattgc ctggccgctt 
acagccctgc tcattcctca cgcccttcaa 
ggactctcca aacgggatac aggagagaag 
tgcacggggc attcctgctt ctcatctcag 
acctgcccca gcactcacac cctaactcaa 
caccgactct acctgactgc tgttgcacct 

<210> 11 
<211> 534 
<212> UNA 

<213> Homo sapiens 



gggtgatcac 
actacgaccc 
tctggaatga 
tgttggatgc 
tcattggtgt 
aggttaatgc 
attccgtgaa 
ctcgcatgga 
tgttccaaaa 
tgggtttgga 
tgctggcagt 
cgaagacagt 
atgaagtgtg 
ccataaagat 
tcacagcggt 
tggacaaccc 
acgtgcagat 
tggagggaag 
aggagcggtc 
tcgccgactt 
ccgaatgaag 
tcaccatggg 
ccaaggctgc 
ttgggtctgg 
ctccccagag 
tgctgcagga 
ctggtctaga 
gccaccacag 
aataaatgtt 
aaggtggtga 



m 

c 



PCT/US01/06059 



caacttcaad^Pfagctcatg 9 60 

catgggaaac cccctggaca 1020 
aggctggttt gtgaggtctg 1080 
taccccgcag gaaagaagcc 1140 
tcgagagggt gatgtgcagc 1200 
cgaccgcatc acctggctgt 1260 
cagtcacacc attggcaggt 1320 
cgtcacggac aagtacaagt 1380 
ggctttgggg aaacttaaac 1440 
aacagaggaa caggagccca 1500 
aggcaaagaa gtcaacctgg 1560 
gacagtgaac atgacagcct 1620 
gaaggactct gccacaatgt 1680 
ctcgtacgct cagtatgaga 1740 
gtgcaaggtc ccagatgagt 1800 
caccttgacc ctggaggtgc I860 
gctcttctcc aatccactgg 1920 
cggcctgctg ttgggtaacc 1980 
ccgggtccgt tttgatatcc 2040 
ctcctgcaac aagttccctg 2100 
ggcgctggtg gcctcccgta 2160 
aatgaacccc ccgcccatgc 2220 
cagacatgga cctccaggct 2280 
gtccaccctg tcctatgact 2340 
ctgcctgctc tgtgagcccc 2400 
tggactggct cctgacccag 2460 
ctgtttgctg atccccaacc 2520 
agggcagggg atggttagtc 2580 
aaataagtgc gatcacacag 2640 
aggagagagg aggctgtggc 2700 



<220> h ; 

<221> mis cofeature 
<223> Incyte ID No: 



LI : 255828 . 29 : 2000MAY01 



<400> 11 

ctgtaccatt 

aatgaatgtg 

caagaatgtg 

tgctgtggct 

cactgcagag 

catggaaaag 

ggaagtgagc 

aggctgtgcg 

agctgtttga 



ggcccagctt 
gctcgtctga 
cgcacagcca 
ctagacacta 
gtggagctga 
tgtgacgaga 
cgactgcatc 
catgcagcac 
agaacttgtg 



cccgatcagt 
acttctctca 
cggaaagctt 
aaggacctga 
agaaggcagc 
acatcctgtg 
atgctgtctg 
ctgatagctc 
cgagccttaa 



ggagacgttg 
tggaactcat 
tgcttctgac 
gatccgaact 
cactctcaaa 
gctggactac 
gagaaacagc 
gtgaggctga 
gtcactccac 



aaggagatga 
gagtaccatg 
cccatcctct 
gggctcatca 
atcacgctgg 
aagaacatct 
caaaggggac 
ggcagccatg 
agacctcatg 



ttaagtctgg 60 
cggagaccat 120 
accggcccgt 180 
agggcagcgg 240 
ataacgccta 300 
gcaaggtggt 3 60 
tatcctctgg 420 
ttccaccgca 480 
gaag 534 



<210> 12 
<211> 2195 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 1190263 . 1 : 2000MAY01 



<400> 12 

ccacgctgtc 

atccttggac 

cctcccgtgg 

gcttgcacaa 

acttctttga 

gtcctaaatc 



gggacattaa 
gcagttggcc 
ctttaggttt 
tggagtcgcc 
acactatggc 
catggttgat 



tttggtgccc 
gaacccggag 
ctcctccatg 
caggagaccc 
cagtgcacac 
gacctggggt 



tacactgtgg 
cttagttaac 
tggaacctga 
ttggtctttt 
atgggtgcat 
atgggagatc 



ggcctcagaa 
ccgcagggcc 
acaacaagaa 
catgttcttt 
tgacgacaag 
tgggctgcta 



aagactcaga 60 

at t tggaagc 120 

ggtattccaa 180 

ggtgtgtggc 240 

gcctaatatt 3 00 

cggcaatg;ac 360 



7/228 



BNSDOCID: <WO 0162927A2J_> 



WO 01/62927 



accatgcagg acgc^^ftca 
cacatctctg ccggcWcct 
ataccccatc cggatcacgg 
atcttgcagt ccccgcgagg 
gaagcaagga tacagcacgg 
tcccagaagt gagccatgtg 
taatgtgcat gcctgttcac 
acaacggaat tagcctttga 
atcctcaccc taacctttgg 
cttctccatg attctgttta 
ccctttatac tgggactgcc 
ggctgaacga gctggatcca 
taaggaaact ttccttctct 
ggacgatttc actggcacca 
ctccatggtg ggcaagattc 
tgtctacttt acatcagatc 



tggtggatgg 
ccgcgtccca 
acctacaagt 
tcaggacagg 
actcggagca 
tccccaagga 
caccagcttc 
acctaccaaa 
ccttgacacg 
tgaaggaaca 
agggcaggag 
gaagaggaag 
cagaggaagc 



aatggaatat 
ggaattgtcc 
ttaatggata 
gtcattgacg 
tgaatttctt 
cgacagtggg 
tggtggctgc 
acccccttct 
tgccacagag 
cccaggaaac 
gtggttgaag 
tctctaagcc 
ctggggtcct 



actaactttg gtgctttcaa 
ccattccaga ttattaaagg 

<210> 13 

<211> 435 

<212> DNA 

<213> Homo sapiens 



ttcgaccgcc 
ctgcagccca 
taataggtta 
cctccctctt 
ggcttatagg 
ccacccatcc 
ctctctgttg 
gagtcagctc 
gaagctgagc 
ttttcctctt 
tcctcatgcg 
ttatggtgaa 
ttttctcctt 
gcaagcatgg 
ttgatgctat 
acggagggca 
acaaaggtgg 
gatggcctgg 
ttttaccaac 
gccgagacct 
ttccactact 
tcagtttgga 
tatgtcacct 
tgcttcttcg 
cccctctatg 
catcgtgctt 
ctttggtggt 
tcggggtcct 
aacgagaaga 
gttggcaagg 
cccactggta 



ttgccaggga 
aagccggtcc 
ctagtgggta 
aatgagacaa 
caaatggcac 
atataattag 
'acatgctgct 
tggctctgtg 
ggctgggtct 
gggctatgct 
ggggcacgag 
ggaggcgatt 
tcttcacgtg 
cttgtatggg 
cgatgatttt 
tttggaagct 
aaaaggcatg 
aaaggtacca 
tgtcgcatca 
catgcccttg 
gtggctccta 
aggctcagta 
cattatgcag 
atctctccag 
atttggtgat 
gtgaccatac 
ggggtgtccc 
aaggagaaga 
gataattaca 
agtgcattta 
gttcc 



agg^^Bcag 

cgggWetgt 

atagaagtgt 
cacttgcagc 
■caaaggtttg 
tggcgttttg 
ggccgggacc 
tgcagctagt 
ctgttccctg 
tggttctcca 
atcacggagc 
tcctttttag 
cacacacctc 
gataatgtgg 
ggcctaagga 
aggcgagggc 

gggggctggg 

gctggacggt 
gtgtcaggag 
cttgcagggc 
cctgcacgcc 
tgtgacccgg 
atgttccgga 
gaacccctca 
taaaaaggtg 
caactttcag 
cattgggtct 
gatatttaca 
atcaggctac 
atagtcaata 



PCT7US01/06059 



actgactcag 
gagggagaag 
gtttccaaat 
cttgttaaaa 
cactggcgat 
actaactaac 
ccctctcgtt 
tgccattgcc 
ggctcctgat 
gccacacgtc 
agcccatgaa 
aaaggcacag 
tccccaccac 
acgagatgga 
acaacaccct 
atgcccaact 
aaggtggaat 
tgattaagga 
gaagtctccc 
aacgtcaggc 
gtgcggtgga 
gtattcaagc 
gaacagggtt 
gagtccacac 
gccaacgccc 
aaattgaatc 
gtgtgacagg 
atcaggctac 
caaaggaagc 
aattcatcta 



<220> 

<221> misc__f eature 
<223> Incyte ID No: 



LG: 27 09 16 . 2 : 2000FEB18 



420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

i020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2195 



<400> 13 

gaagttttcc 

aatgatctcc 

gtgcccccca 

aggcgtcctt 

aactgcattg 

tgctgagaca 

ccactgttga 

ttcttccaca 



tccaacggct 
agatgctctc 
cccagaatgc 
gtggcagtga 

tgggagtttt 

ggtgactagg 
cggacggggg 
aacag 



tatggccctc 
cgatgcacct 
ccttccagaa 
tttggggaac 
gaaactgtgg 
gtgcactgga 
aaaagccaga 



tacgtggctt 
gctcaccatc 
gtgcttgctg 
cactgaggca 
agtcctggtc 
agaggttagc 
accgaccgct 



ctcactacaa 
tcttctgcct 
ttatccaggt 
tcaggaatta 
tggaaccaag 
gccactagac 
ctcttatttg 



gaactctccc 60 
tctgcctcct 120 
ataggagcag 180 
gtggcttaat 240 
gggctgggtc 3 00 
acccaaagct 3 60 
actcacagct 420 
435 



<210> 14 
<211> 1574 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_ feature 
<223> Incyte ID No: 



LG : 999414. 3 :2000FEB18 



<400> 14 

tagattcaat 

gaaatgatgt 

aggtcttccg 

cttgtgtcgt 

agtagagtgt 

agagagaagg 



tcatttggaa 
cagaaaaccg 
attgagaact 
ccttttacat 
atttcatggc 
acactgtcat 



acacagctcc 
aaggatgatg 
ataaggtttg 
attctacatc 
ttcttaattt 
tgcattgtat 



catttttgag 
ataaatacta 
atttcatttt 
aatggaagtt 
ctttcaatta 
ttttttctga 



tgtgcatgta 
aaagtaacag 
cagataatgg 
gtagcaccgt 
gactgagatc 
aaagagtagg 



cttttatgaa 60 
gcatgtgaaa 120 
ggtcctagct 180 
gtcagaataa 240 
tttttcttca 300 
ccgtatttta 3 60 



8/228 



BNSDOCID: <WO 01 62927A2_I_> 



WO 01/62927 



ctgagatcac 
tctctaaagt 
tgagattttt 
tcaagatgtt 
ttttccaatc 
ggtgtctgag 
gctttgctgt 
atcctggctg 
ctgtgctggc 
cagtcgcata 
cttgttgcat 
atctgtcgcc 
ttggtgtcct 
attaaccatc 
caaccttgcc 
atttggtttt 
tctaagaatt 
ggcagttgtt 
accagccccc 
gaaccctttc 
agcagaacag 



ggagttgttcf 
agtatgaaca 
ggaattgtca 
ccttgtgaat 
ctctttcctt 
aattcctcct 
ccctgtccct 
tcagctctga 
ctgacatcgg 
gcagagtcat 
gtatagaagg 
ctctgcacta 
tttttcctta 
atcaagaatg 
tgttctgaca 
cttcccatct 
tcatcatcag 
tgcttatttt 
aggagtggtg 
atctactgcc 
tcga 



gtgacgt 
gaaagccttg 
caactgagaa 
tgctcgcatc 
ctcaaggtgt 
cctgggactc 
gtccatgtat 
ctgccccctc 
tttcacctcg 
ctctcatgcg 
catgctcctg 
cccagtcatc 
gcctgttgga 
tggaaatctc 
gcgtcatcga 
cagggatcct 
atgggaagta 
atggaacagg 
tggtggtgtc 
tgagaaaaca 



tttggtcttc 
tatagcaaaa 
acattgctgg 
cagcattaag 
'cca'agacaca 
tcagaggatc 
ctggtcacgg 
cacaccccca 
gccatggttc 
ggctgcctga 
actgtgatgg 
gtgaatcctc 
ttcccagctg 
taatttggtc 
tagcatattc 
tttgtcttac 
taaagccttc 
cattggcgtg 
agtgatgtac 
gggacattca 



taatattct 
agtaaatcac 
.cggtgtatgg 
ggctggtttt 
cggagccacg 
cagaactgca 
tgctgaggaa 
tgtacttctt 
ccaagatgat 
cacagatgtc 
cctatgactg 
acctctgtgt 
cacagttgga 
tgtgacccat 
atatatttag 
tataaaattg 
tccacctgtc 
tacctgactt 
actgtggtca 
aagcgcctgt 



PCT/US01/06059 



gtggattt 
gtaataattc 
tccgcaagtg 
tatcttttat 
gaatctcaca 
gccggccctc 
cctgttcagc 
cctctccaac 
tgtggacacg 
tttcctgctc 
ctttgtagcc 
cttcttcgtt 
ttgtgttaca 
ctcaacttct 
atagtactat 
tcccctccat 
gctctcacct 
cagctgtggc 
cccccatgct 
ggaggtgcgc 



42 0 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1574 



<210> 15 
<211> 849 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: LG : 429446 . 1 : 2000FEB18 



<400> 15 

caggtcagag 

tctaagaaca 

gtttgtctct 

gatgcagtct 

cgatgttgat 

acgccccaat 

tgagttttcc 

ctaagaatcc 

caacttagct 

gcaaaaacag 

acaatgatca 

cctccgtttg 

ttctgatcca 

ttgccaccat 

ctgtgctca 



aacaaaaggc 
aagacccacc 
tccagaatca 
atgactaaga 
gacatcgaag 
tccgcaggaa 
tgttgagagg 
ctaagcctat 
cacacccgac 
gaggtaaaga 
ggatataaac 
tatgggagct 
tgtttgttac 
cacagacccg 



ttgctgccat 
cgtaacattt 
aaactgtaaa 
tctaccacag 
gcgcccctcc 
gcagttagag 
gagaactgag 
ctgggaaggt 
taatcagata 
aatagccaat 
ccaggcattc 
ctgttttcat 
agcttgagct 
ccactgactt 



cttgggagtg 
ggtggcaact 
actacaaatt 
acccttggac 
caaggaaatc 
cagtcattgg 
agacaggact 
gaccgtatcc 
gtaaggagag 
catctattgc 
aagccagcaa 
gctattttaa 
gagcttttgc 
ccatccctct 



gccgccacca 
gtacggggat 
gttcttcaaa 
cggcctgcta 
tcaactgcat 
ccaacctccc 
agctggattt 
acctttaaac 
ctcactaaaa 
ctgagagcac 
tggctaccct 
agtcttgcaa 
tcaccatcca 
ggatccagca 



tcttgggagc 60 
tctccaaagt 120 
tggagcccca 180 
gcccatgctc 240 
gacccctact 3 00 
caacagcac t 3 60 
cccaggccaa 420 
atagggcttg 480 
tgctaattag 540 
agcaggaggg 600 
ctttgggtcc 660 
ctgcacactc 720 
gcaatgctct 780 
gggtgttcag 840 
849 



<210> 16 

<211> 443 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
<223> Incyte ID No: 



LI : 057229 .1:200 0FEB01 



<400> 16 

aaactatcct 

cacatatata 

ttgcttgaat 

ttgagacaga 

tgcaacctct 

attatagaca 

cgccatgtta 

acttactaag 



ggattgtata 
taaaataaca 
ctaaaattgc 
gtcttgctct 
gccttctggg 
tgcgccaccg 
gccaggctgg 
attattgtat 



tatatgtata 
acccagtcta 
aagacaattt 
gtttcccagg 
atcaagtgat 
tgcccagctg 
tctggaagca 
tec 



tattgtgtat 
cctgaatttt 
attttatttt 
ttggagtgcg 
tctcctgcct 
atttttgtat 
ttttaaaacc 



atatatgtat 
cacattttca 
attttatttt 
gtggtgcggt 
cagcctcctg 
ttttagtaga 
aaaaggtttg 



atattatata 60 
gaaaactgta 120 
acatattttt 180 
cttggctcac 240 
agtagctggg 300 
gatggggttt 3€0 
tatttaactt 420 
443 
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<210> 17 

<211> 1393 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 3519 65 . 1 :200 0FEB01 
<220> 

<221> unsure 
<222> 120, 765 

<223> a, t, c, g, or other t 

• . i 

<400> 17 

gaagctgtaa aactactaat tgttcttcaa aaggagcccc agat&cagtc catgactaag 60 
atctaccaca gacccctgga ccggcctgct agcccatgca ccgatgttaa tgatatcgan 120 
agcactcccc caaggaaatt tcaactgcac aacccctact acaccccaat tcagcaggaa .180 
gcagttagag cggtcgtcag ccaacctccc cagcagcact tgggttttcc tgttgagagc 240 
ggggactgag agacaggact agctggattt cctaggctga ctaagaatcc ctaagcctag 300 
ctgggaaggt gaccgcgtct acctttaaac acggggcttg caacttagct cacacccaac 360 
caatcaggta gtaaagagag ctcactaaaa tgctaattag gcaaaaacag gaggtagaga 420 
aatagccaat catctatcgc ctgagagcac agcaggaggg acaatgatcc ggatataaac 480 
ccaagcattc gagccagcaa cggctgcccc ctttgtgtcc cctccctttg tatgggagct 540 
ctgttttcac tctattaaat cttgcaactg cactcttctg gtctgtgttt gttacggttt 600 
gagctgagct ttcgctbgcc gtccaccact gctgtttgcc gccatcggag acctgccgct 660 
gacttccatc cctccggatc tggcagggtg ttcatgtgct cctgatccag agaggcaccc 720 
attgccattc ctgattgggc taaaggcttg ccattgttcc tgcangacta agtgcccggg 780 
ttcatcctaa tcgagctgaa cactagtcgc tgggttccac gattctcttc cgtgacccac 840 
gacttctaat agagctataa cactcaccgc acggcccaag attccattcc ttcgaatccg 900 
tgaggccaag aaccccaggt cagagaacat gaggcttgcc accattttgg aagtggcctg 960 
ccgccatttt ggaagtggcc tgccaccatc ttgggagctc tgggagcaag gaccccctgg 1020 
taacagtagt agatacatag aatttttaaa taaacatact tatattaagg gtactcttag 1080 
ggtctatact aagaaatttt gcatgcttgt cattaaaagc tttgagtcca agtcctgtgt 1140 
gtgtgtatat gactttgact ctaagtcaag tgtaaatgta actgtataga tatgccatgt 1200 
atataatgta aataaaatat atacatatat ttataaataa ggcagtctga aaacttgctt 1260 
gctggtgaac tttaaaacag cctctttctc gtgatatacc cctccagttt atccagtgga 1320 
aaagaaaatt actgctggat taaaaacaag tgcttaggct aggtgtggta cctcatgcct 1380 
gtaatttcag cac H ' 1393 

<210> 18 

<211> 884 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 068682 . 1 :2000FEB18 
<400> 18 

acagcggcca atgtctggca gtgggcacat gggggtgcgg gggtgtaggt gccaagcgcc 60 
atggcttaga cccgagattg gagtccggcc gcccccccgc agccaagccg cctcctgccc 120 
cccgtgcgcc ctaggcgcca ccatgtcggg agacaaactt ctgagcgaac tcggttataa 180 
gctgggccgc acaattggag agggcagcta ctccaaggtg aaggtggcca catccaagaa 240 
gtacaagggt accgtggcca tcaaggtggt ggaccggcgg cgagcgcccc cggacttcgt 300 
caacaagttc ctgccgcgag agctgtccat cctgcggggc gtgcgacacc cgcacatcgt 360 
gcacgtcttc gagttcatcg aggtgtgcaa cgggaaactg tacatcgtga tggaagcggc 420 
cgccaccgac ctgctgcaag ccgtgcagcg caacgggcgc atccccggag ttcaggcgcg 480 
cgacctcttt gcgcagatcg ccggcgccgt gcgctacctg cacgatcatc acctggtgca 540 
ccgcgacctc aagtgcgaaa acgtgctgct gagcccggac gagcgccgcg tcaagctcac 600 
cgacttcggc ttcggccgcc aggcccatgg ctacccagac ctgagcacca cctactgcgg 660 
ctcagccgcc tacgcgtcac ccgaggtgct cctgggcatc ccctacgacc ccaagaagta 720 
cgatgtgtgg agcatgggcg tcgtgctcta cgtcatggtc accgggtgca tgcccttcga 780 
cgactcggac atcgccggcc tgccccggcg ccagaaacgc ggcgtgctct accccgaagg 840 
cctcgagctg tccgagcgct gcaaggccct gatcgccgag ctgc 884 

<210> 19 
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<211> 1116 

<212> DNA 

<213> Homo sapiens 

<220> 1 
<221> misc_feature 

<223> Incyte ID No: LG : 242665 . 1 : 2000FEB18 
<220> 

<221> unsure 

<222> 1087, 1090 

<223> a, t, c, g, or other 

<400> 19 

caccacctca tttcgcttta tttcacagac ttccccatct cattttttat gttctatgct 60 
aatttctcaa gaagaacagg cccggcacca cctctgcgca caaccccgcg ggcctggctc 120 
aggcgggagt gcggggccag caccatgagc gccccgggca gccccgacca ggcctatgac 180 
ttcctgctca agttcctgct ggtgggcgac agggacgtag gcaagagtga gatcctggag 240 
agcctgcagg atggtgcagc tgagtccccg tacagccatc tcggggggat cgactacaag 300 
acgaccacca tcctgctgga cggccagcgg gtgaagctga agctctggga tacgtcgggg 3 60 
cagggaagat tttgtaccat attccgctcc tactctcgtg gtgcacaagg agtgatcctg 420 
gtctacgaca ttgcaaaccg ctggtctttc gagggtatgg atcgatggat taagaagatt 480 
gaggaacatg cccctggtgt ccctaaaatc ctggtgggga atcgcctaca tctggcattc 540 
aagaggcagg tgcccaggga gcaggcccag gcctacgccg agcgcctggg tgtgaccttc 600 
tttgaggtca gccctctgtg caatttcaac <atcatagagt ctttcacgga gctggccagg 660 
atagtgctgc tgcggcacag gatgaattgg ctcgggaggc cgagcaaggt actgagcttg 720 
caagacctct gctgccgcac catcgtgtcc tgcacacctg tgcatctggt ggacaagctc 780 
ccgctcccca gtaccttaag aagccacctc aagtccttct ccatggctaa gggcctgaat 840 
gccaggatga tgcgaggcct ctcctactcc ctcaccacca gctccactca caaaaggagc 9 00 
agcctctgca aagtgaagat cgtctgccca ccccagagcc cacccaaaaa ctgcaccaga 960 
aacagctgca aaatttctta aggaaggcac caaaaggaaa caagctggaa tcgctccagg 1020 
aaaaactctg gttacacctg gaagatggaa gtcgcattgt agattcaaga aattagtttt 1080 
cagtttntcn cgggaaccat gctgcttggg aatgtg 1116 

<210> 20 
<211> 585 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LG : 241743 . 1 : 2000FEB18 
<400> 20 

ccggcccaga atgtagaaac ggttaggagg gacctgtgaa aaagcgggac accaagaaga 60 
gcactgttgc tgtttttagt ttttaaaata agaggaagcc cagtgtccca tctgatgccg 120 
aggctgaagg agtctcgctc ccacgagtcc ctgctcagcc ccagcagtgc ggtggaggcg 180 
ctggacctca gcatggagga agaggtggtc atcaagcccg tgcacagcag catccttggc 240 
caggactact gcttcgaggt gacgacgtca tcaggaagca agtgcttttc ctgccggtct 300 
gcagctgagc gggataagtg gatggagaac ctccggcgag cggtgcatcc caacaaggac 360 
aacagccggc gtgtggagca catcctgaag ctgtgggtga tcgaggccaa ggacctgcca 420 
gccaagaaga agtacctgtg cgagctgtgc ctggacgatg tgctctatgc ccgcaccacg 480 
ggcaagctca agacggacaa tgttttctgg ggcgagcact tcgagttcca caacttgccg 540 
cctctgcgca cggtcactgt ccacctgtac cgggagaccg acaag 585 

<210> 21 

<211> 2950 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 034212 - 1 : 2000FEB01 
<220> 

<221> unsure 
<222> 52, 132 
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<400> 21 

tactatattg ctcagcattt ctaagtattc tctaagtgct ctttatttat gntttaaaat 60 
agctctctta cccagctgcg aagactagaa gaacttgatt taggaaacaa tgaaatatat 120 
aatttgccag antcaattgg agccctctta catctaaaag atctctggtt ggatggaaat 180 
caactgtcag aattacctca ggaaatagga aatctgaaga acctgctgtg tttagatgtc 240 
tctgaaaaca ggttggaaag acttcctgaa gaaatcagtg gcctgacttc attaacggat 300 
ttagtcattt' cccagaactt attagaaacg attccggatg gcattggaaa actaaagaaa 360 
ctgtcaatct tgaaggtgga tcagaataga ctcacacagt tgcctgaagc agttggggaa 420 
tgtgaaagtc tcac'tgagtt agttcttaca gaaaatcagc tcctgaccct gcctaaaagc 480 
attggaaaac taaagaagtt gagcaacttg aatgcagaca gaaataaatt agtgtcctta 540 
ccaaaagaga tcggcgggtg ctgcagcctc actgtgttct gtgtacgtga caacagacta 600 
actcggatac ctgcagaggt gtcacaggca acagaacttc a.tgtcctgga tgtggcaggg 660 
aacaggttgc tgcatctacc tttatccctg actgccttga agttgaaggc tctgtggcta 720 
tctgacaacc agtcccagcc cctgcttaca ttccagacag acacagacta caccacagga 780 
gagaagattt taacctgtgt cttacttcct cagctgcctt ctgaacctac ttgtcaagag 840 
aatctgcctc gctgtggtgc actggagaac ttggtaaatg atgtctctga tgaagcctgg 900 
aacgagcgtg ctgtcaacag agtcagtgcg atccgatttg tggaggatga gaaagatgaa 960 
gaagacaatg agacgagaac acttctaagg cgagccactc cacacccagg ggagttaaag 1020 
cacatgaaaa agacagtgga gaatttacgg aatgacatga atgctgctaa aggactggac 1080 
tcaaacaaaa acgaggtcaa tcatgccatt gaccgagtga ccacttctgt gtagagtttc 1140 
acctccaagt tttacctcct gtgtcttcct ctgctgtcga gacgttcctg tctgcttccc 1200 
gggagcctca cgtgctcctt gtcctaacca gcccccgcgc gccatcttcc cgtggagtgt 1260 
ggggaagctg ctgtctccca ggaagtgcct tactcatccc gcaaccagtc agcgcaccag 1320 
tggtctcccg gtgtgatttt ttttttttta atttcagttg tttgtaataa gtagaataca i380 
ctactgtaaa catacgacct ttgtttttgt cttatgttgg ggtaaaggaa agcaggaagg 1440 
ggaattttta tcctcctccc ttccgtaaag tgctgggata ttttgaatcc cccaagttcc 1500 
cttggaccta ctgatgagag atagttttat gtatggggaa aaatggatac tttttaaacc 1560 
ttttttggca gctcagatgg tgtaaatttt aaaattttgt ataggtattt cataacaaaa 1620 
atatgtattt cttttttgtt attttatctt gaaaacggta catattttag tatttgtgca 1680 
gaaaaacaag tcctaaagta tttgttttta tttgtaccat ccacttgtgc cttactgtat 1740 
cctgtgtcat gtccaatcag ttgtaaacaa tggcatcttt gaacagtgtg atgagaatag 1800 
gaatgtggtg ttttaaagca gtgttgcatt ttaatcagta atctacctgg tggatttgtt 1860 
tttaaccaaa aagatgaatt atcaatgatt tgtaattata tcggttgatt ttttttgaaa 1920 
agatgaacca aaggatttga ctgctaatat tttattcctt acactttttt tctgaataag 1980 
tctctcataa tgagtgcagt gtcagactgt gcctactctg atggtatgtg ccatttgtaa 2040 
aataaaatag agcagaaaaa cacaaaaaga gaacactggt tcagacattc agtgggcaag 2100 
taaattatgg actgcaaaat aatgattttt attcaagaaa gctttaaaag ttttatatcc 2160 
agatatacaa ccacaataaa gcaaaataac ctactatcaa aatagaaatg ttgctatctt 2220 
tataagtgca atttaatttg taaatagagt ttgaatcaaa gtatcacaaa atactgcttc 2280 
aagatttaat tttaaatctg ctaatttaag ggatattggg aaaagttttg gtgtgtttct 2340 
gttgatttct tttttgtatg ctgtgataaa agagaaatga aaagtgccag tcactgtgtg 2400 
gtgtctagga aaatcatata tatttttttc tccaagaaat aaattcatcc tggacattgg 2460 
ccatacagct ttttaaaatt attactttgt atgttcaagt gatagcaggt agccaaattc 2520 
tttgacagtg tgctctggtc tgttaaatat ctaaattacc cgtcagttgt gagtgacctc 2580 
ctgtgggact tgcattcaca tggggcagag cccagaattg cctttgactc tggctagtaa 2640 
ttttgggttg tggctatctg gccaattgga ctccttataa acccgtcttc aaccaaccaa 2700 
ataccatact tttttttact tcctatacat ttacttattg ggttcactta gctttttagt 2760 
ctgtgtgtat gtaaatatgt gtgtatttta aaaactattt atggagctga aagtacataa 2820 
cacataagtg agtttcggct ggtcagtttt ataccttgtt ccctaatctg tgcataccaa 2880 
ttttcttttc tctgttttaa gatttttgac atcctctatt ttgttttaag cttataaaga 2940 
taatgccact 2950 

<210> 22 

<211> 1973 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 344886 . 1 :2000MAY19 
<400> 22 

gttctgccga gtggggcgcc gcggccgcta ttgtcccgcc ccctgctccg caagattcga 60 
gcctgagcgg cctgggcgtc tcgagaggtg agagagttgg cggcgaggtc tcggcggcta 120 
agcgagcgtc ggcgactgtc tctccgcgag aggaggcaag ttggggtcca ggctccaaag 180 
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ccggtggccg cgtaccgcgg^Pgagccgct 

agtctagctc tttcacccag gctggagtga 
cgccccctgg gttcaagtga ttctcctgcc 
atatcctgag agaagatggg aaagggctgc 
gggaaaactg caattttgga gcagctcctt 
tgcgaaacaa tggaagatgt ' atacatggct 
cagttacatc tttatgacac cagaggtcta 
ttttcatttg ctgat'ggctt cgttcttgtg 
ttctgaagaa agaaatcgat 
gaaacaaaat cgacctttct 
caaaaagtga gaaagtaaga 
aaccattcac tttattagcc 
tgcctgggag gaaaaacaaa 
caattgtatg ttgaatagtg 



agagtggagc 
gtggtattag 
cagcagtggg 
actctgattg 
agctttcctt 
gtaatttcca 
aatattacta 
ggaagtcaat 
atatgaaata 
tgttatttat 
ctatgtgtaa 
aacattatgc 
atgtctagga 
tggggtagta 
tttagctttg 
ggttcatcac 
cttgagggca 
gctgtatttt 



tttaaaatag gccatttgta 
taatgcactt tagatgttaa 
atatattctt attctcattg 
gttgtacttg tgaaaacagt 
cttcccccca accccaagct 
aaacttctaa gcccaaacat 



ttataaaaca 
agctaaattt 
ctcaaattag 
gctctaaggc 
aactgtttgc 
ctggaaggaa 



tttttttgtg 
catcttgtgg 
aatgtttagc 
cgttttttcc 
tcctcctctt 
gactgtatct 
atattgggct 
tttggcaaga 
aagctactta 
ttcaacaaag 



gtctttgagg 
agtggtgcaa 
tcagcctccc 
aaggttgtgg 
tatggaaatc 
tcagtagaaa 
caggaaggcg 
tacagtgtga 
aagttcaaag 
gagcagagac 
ctgtgggagg 
agtaaacttt 
gggaactcta 
attgccttta 
tctacctttg 
aagtatttgg 
tttgaaacct 
aaaatagttt 
gtttcggaag 
gactttgttt 
tctaaattgg 
agattttgct 
atgagttgag 
ttagccagac 
gaaaatgatt 
tctggaatag 
ctaatttgat 
ctcaagcaat 
gtggtttcac 
aacatattaa 
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t c t gaggag^^^Tacagacag 

tctcagctca gtgcaacctc 
cagtggctgg gattacagtg 
tttgtggatt gttatctgtg 
atactattgg aatggaagat 
cagaccgagg agtaaaagaa 
tggagctgcc aaagcattat 
ataaccttga atcctttcaa 
acaaaaaaga ggtagcaatt 
aagtggacgc tgaagtggca 
tgactgttac agatcggaaa 
ctcaacccca gagcaaatca 
attctgagaa ctaaaaatca 
agtgtctgtg aacatggagt 
gtccttagga aaattcctaa 



atacaataaa agtatcaaaa 
tgtgttgttt ctctaaagat 
taatttaaaa ggtaggaagt 
ctgaaatgta aaataagcta 

<210> 23 

<211> 683 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> mis cofeature 

<223> Incyte ID No: LG: 228930 . 1 ; 2000MAY19 



gctaaggtta ttattgcctg 
gtctttgaaa ttagcacctt 
ggatagttat gcaaatgcac 
atatcataat cattctgtgt 
ttaaaaagtt cattaatcta 
acccaaaaca ttgaacagtt 
aaacagacta agacccatga 
gtaccaggta gtgttaagta 
ccctgttgat agaccagata 
aggcacttaa ggacagtaaa 1680 
ttttctagaa aactagtcat 1740 
ctgacttaga tgtctgagtt 1800 
gtggctgact gtaactttat 1860 
ctgtgaaata actattttga 1920 
aacatcaagt aat 1973 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 



<400> 23 

gccggagccc 

caagcccgtg 

tgactcaact 

ctggacagat 

atgtgggaga 

gaacatcggg 

catctcccat 

ggttttcagt 

aaaagcctat 

catatatcgc 

tcttccatct 

aaaggctccc 



agatgagccc 
tgcgtgtccg 
attgttactc 
caaaacaagg 
cacgccaaag 
cgcctggagc 
ttgaatctcg 
ttggcaacaa 
actaaactta 
ttgttttcag 
tcaagggtgg 
ccaaaacatt 



agatggccgg 
acagcctcaa 
caattatttt 
agacagagct 
ctcccaagga 
agcgcatgat 
tggctttcca 
acctgctggc 
agctgcaagt 
cagatcggaa 
aaaaagctaa 
ttc 



ggctcaaccc 
gaagggcacc 
gaggactgac 
actggatctc 
ccccaaatta 
cacagtggtg 
agaagaagtg 
ccaaaacatg 
cactccagaa 
gcgagttgaa 
tgaggctgcg 



ggagtgcacg 
aaattcgtca 
cctcagggat 
agccttgtca 
cgtgaacttt 
tatgggcctg 
gccaaggaat 
tccagggatg 
gggcgtattc 
actgctttag 
aaaagtgagc 



ccttgcaact 60 
agtgggatga 120 
ttttctttta 180 
aagatgccag 240 
tggatgtggg 3 00 
acctcgtgaa 3 60 
ggacaaatga 420 
catttctgga 480 
ctctcaaaaa 540 
aggc 1 1 gtag 600 
agagctgtgg 660 
683 



<210> 24 

<211> 1435 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LG: 33 8927 . 1 : 2000MAY19 



<220> 

<221> unsure 
<222> 1409, 1432 
<223> a, t, c, g, 



or other 
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# 

ctTJag 



<400> 24 

gtgagccgac cttctgagcc 
atctgtccca ggccagcgtg 
ggtagctgag atccaccccg 
catctgtgtc caacctggtc 
gaggccagag gctagaggac 
gaccacggga gaagacgaat 
ggaggtacct gaactccctg 
agcctgtgac cctctcagga 
tgctggagac agagcaggcc 
aggagctgct gaagacagcc 
tcttctccaa catctcctcc 
agcggcgcct ggacgactgg 
ctggccaggt ggaggtgggg 
cttattgtgt ggccatgcct 
gaatgagctc tgagaggctg 
gtcctggcca agccaagtga 
gagggcttcc tacacgcctg 
tcagcctgct gtgtgatgac 
tggccactaa agcatttaac 
ggatgtcctt taagatcttt 
attgattgct taaaacattt 
catacctggc gcatgcatat 
ttacctggga tgaagaataa 
agtgccagtc atgatagtag 

<210> 25 

<211> 1508 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: IiG: 898771 . 1 : 2000MAY19 



ctcaagaaag 
gctgagtgtg 
gaaaccggca 
actgtgtttg 
gtgcatcacc 
gtcggggagg 
aagaacaagc 
tcggggacgc 
tatgtggcgc 
cgcagcagca 
atctatcagt 
tgaggtccac 
agaccccagg 
ccctggggct 
tgtgtgtgca 
ccaagtgact 
gccttgggct, 
ggcatcatgg 
agtgaagtta 
ggcaaagaag 
ataataaact 
tttttaagag 
agagagcaaa 
taataatant 



atcagaacag 
ctggctggag 
ggatgaaggg 
agaatagcag 
gccctgagtg 
ccgtggggtc 
tgtccagcga 
aggagccaga 
gcctccacct 
aggccttccc 
tccattctca 
caggagcccc 
gcgctaggtt 
cggtctgtct 
gactgcttcc 
ttgggacagt 
ttccctctat 
ttatgcaaga 
cttgccttat 
aatgctagta 
gttgtttttt 
tacccactgg 
ctaccacaag 
aataaacatc 
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attc^Tgggt 
gcctctctct 
ggcaagtgag 
gaccccagaa 
caggcctccc 
tgagcccagg 
agcctggagg 
gaagaagatc 
gctagaccag 
agaggatgtg 
gttcttcctc 
tgaggcctca 
cagcctccta 
gcacagtgaa 
tggggaaagg 
gaaaatagcc 
taactccctc 
aaccctcttg 
atagtttgcc 
attgttaaac 
tttaagataa 
agtctaatat 
caatggttga 
tattgggcca 



gatttagcct 
ctgcttcgag 
gagaagctgg 
gcagcaccca 
gagtccccag 
acagtcagca. 
aaatcttgcc 
gtccaggagc 
gtgtttttcc 
gtcagggtca 
ccagagctgc 
gaccacagcc 
ggcccaggcc 
gggcttgggt 
ggtggatcag 
agtgccaact 
aatcttcaca 
ttttcacaga 
acgaagtcta 
ctagtgatgt 
taatagccac 
atagcctggg 
gcccctgtca 
anaaa 



<400> 25 

ggtgacccag 

ctggcatgtc 

ggctgcccgg 

actcagaatc 

tataaaggaa 

tgcagcctcg 

actgcagtgt 

gctttgtgca 

cgtcggagca 

cgttctccta 

ccctgagcgg 

aaggctggca 

tggaactcac 

agctgcccaa 

cgctctatca 

cccctctgaa 

ccccgacggg 

cctccagtta 

cagcaaaggc 

gagcccacct 

cgggcggcac 

ggaatggagc 

atccccgttg 

acgctccttg 

caggatggcg 

aattgatt 



cgacctgtgt 
ctgggtggtc 
ggtgggtcag 
ctccagctct 
gcaaaatcag 
acctcccagg 
ccccaactcc 
gaacgtgacc 
gcaggcagac 
cgcgcagctg 
gatctacgcc 
gctgaagccg 
aggcatattg 
gcagtgtacc 
caaagaagag 
gatccacatc 
caccatcagt 
ccgctttgtg 
cgcgtcggag 
ggcgccgtgg 
agacagccca 
cgcagctggc 
gtcatttgct 
ttcatctgcc 
ttctccttat 



gtcccacagc 
ccagtggcac 
catggggttg 
ctcgcctgag 
ggcctagggg 
ctcaagccat 
tgcccagcca 
tcggacctgc 
acgtctggag 
atcgtgcagg 
cacatcacca 
agtccccagg 
cttcccttaa 
ttccggtttc 
gccccagcct 
ccggagccgg 
gtccccaact 
cagaacgtga 
cagcaggcag 
tgcacctggt 
gggcctgcgg 
ggtagagccg 
ggtgacaggg 
gctggcagcc 
aactttgtct 



ccgtccgtcc 
cttctaggcc 
tccctgtgag 
gcggaaatgt 
ctagagtgca 
cttcctgcct 
gtccacgcgg 
agctggcagc 
gagacagccc 
ccatctcctc 
agcattaccc 
gccggatcgc 
aacagaagag 
ccagcacggc 
ccccgctgcg 
acctccggag 
cctgcccagc 
cctcggacct 
acacgtctgg 
gacccaggat 
gcgtgtgctc 
tcggcccttt 
gtggcgccgt 
cggcgtcaaa 
tagttcattt 



ctctgtcagt 
cggcccctgg 
cacagctccg 
gacttgggaa 
gtggtgtgat 
cagcctcctg 
tgccggctcc 
agagtttgca 
caaggatgag 
cgcccaggac 
ctactaccgg 
ctctgaaggc 
acgcggcgcg 
catcaagatc 
gccactgtac 
catggtcagc 
cagtccacgc 
gcagctggca 
aggagacagc 
cgggccatag 
gaggtggtct 
ctcgttttcg 
caccgagggc 
gccttcatac 
ttgtccaata 



ctccctgtca 
atgcggtgga 
ggagacccgt 
ttgtcctcca 
catagctcac 
aacagctgga 
tccagttacc 
gcaaaggccg 
tcaaagccgc 
cggcagctga 
acggccgaca 
ccttagaaca 
cccgccctgc 
cagttcactt 
ccccagatct 
cccgtcccct 
ggtgccggct 
gcagagtttg 
cccaaggtct 
tgacccggcg 
cagcccacag 
tgcccaaatc 
ccctcccgcc 
agcccttgag 
gctgttttca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1435 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1508 



<210> 26 

<211> 574 

<212> DNA 

<213> Homo sapiens 



14/228 



BNSDOCID: <WO 0162927A2_I_> 



* WO 01/62927 



PCT/US01/06059 



<220> 

<221> misc_feature 
<223> Incyte ID No: 



LI :257664 . 67 :2000MAY01 



<400> 26 

cgacctaaga 

gctgat'ggtt 

cagattaggt 

caaggagcaa 

acaaacatga 

atttaaagga 

tttgggagca 

atccagctgg 

ttgatgtttt 

ttttaaaact 



attgccggga 
gaggaatgta 
gagctttcaa 
cagcgatggg 
agatcatact 
ccatcagcca 
agtaaagtgc 
gggaaagtta 
tccatatggt 
atctactaaa 



ggatggatgg 
gagagactgt 
tctctcttta 
taggcacatt 
gggaaactga 
tattggcata 
gtaacataaa 
tagahttcag 
ttctgttttc 
ttccatctac 



ctggcgcgcg 
gtggggtttg 
aaaggggcgg 
caaaaccaga 
ggccagagac 
catttgtgaa 
tttttttggc 
ccttggatga 
ccccagctct 
ccac 



aattgttgct 
tgtgacactg 
cgaagaggac 
aaaaattgtt 
ccgggaagaa 
taacacacaa 
catgcttaaa 
tcagtttagg 
ccatttgaac 



tttgctcttg 60 
cggtgccttg 120 
ccgcaggctg 180 
aacaaaccaa 240 
aataaaaatc 3 00 
tttcctgaca 3 60 
gtagaatgag 420 
atttcgagaa 480 
cagctgcttg 540 
574 



<210> 27 

<211> 1969 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 001496 . 2 :2000MAY01 



<400> 27 

ggggcgctgg 

atccgaggag 
agagacggcg 
gttcagcgta 
gacatttaat 
gatggcccag 
aatcaaaact 
aaaaggaatc 
tcctgatgtg 
gctcatctta 
gatgatggag 
agaatctcct 
aactacagag 
tgaactggat 
tcaggtatta 
aggaagaagg 
actagaagca 
ctgaacttcg 
aagatgagat 
tagaatctta 
tagaagagaa 
aggccaacgc 
acagattatc 
aagaaaaagt 
tgaaggaaac 
aagggttaat 
cacctgaaat 
accaagaagg 
atctacgcaa 
tacagtcaca 
atgctatttc 
ccaataatga 
acagctcctt 



gggtgacggt 
caggcgggcc 
gcggtgtctg 
gagtcgctgg 
gttggatgca 
gttcttcaaa 
gaagtaggag 
ttggattata 
aaccttattg 
ggctgtgctg 
gaatctgttc 
gtctctgctg 
gaactaaatg 
atgcaggttg 
atggaaagac 
catttgcagc 
gccaaagatg 
gcaacagaat 
cgacgtgctg 
taaaaagaag 
gaataccatg 
agcgcgaagt 
cgaagaatca 
tgacagtctt 
cattgaagag 
gcctcttgga 
cagggagaaa 
ttcggacaat 
gaatgaactg 
agttgaagaa 
agtccttcta 
actacagaag 
aaaaattgaa 



gcggagccgc 
tgaggccgag 
gccaggcggt 
agcgggcgga 
ccatgccaga 
agatagatcc 
ataattggag 
atcatgagat 

gggagcattc 

tgaactgtga 
aacatgttgt 
gaaatgatgc 
aagctttgtc 
cagcattgca 
tcaatcaatc 
tccagactca 
attatcgaat 
gatgaactga 
agacattctt 
ctagaagacc 
tatatgcaga 
caacttgaaa 
aagaaagcag 
caaaaagaaa 
cttcgttgtg 
agtcaggagt 
cttattcgtc 
gaaaaaatag 
gagacagaga 
ttacaaaaat 
aaaaagaagc 
aagagagcca 
gaattaccaa 



tgccagcgct 
tcagctgcgc 
aggcgctgcc 
gctgtgcgag 
acccgtggaa 
tgcatatttt 
gctaaagata 
tttaggacag 
tgatgcagca 
acagaagcaa 
catgacagcc 
ctatgttgac 
agcaaaggaa 
ggaagagaaa 
tgattctata 
attagaacag 
acgttgtgaa 
ccactttggc 
ctgataaagt 
ttggtgattt 
atactgtcag 
cctacaagag 
ataaactaga 
aggacaggct 
tacaagctca 
cttcagacag 
ttcagcatga 
ccttattgca 
ataggctggt 
ctttacagga 
ttgaagaaca 
ttattgaaga 
gaagctttac 



gggcgagagt 
gggcccccgg 
tggcccgcgg 
agcctcctca 
gatttaacga 
gatgaaaatt 
agcaatttaa 
caaattaatg 
gagcttggaa 
gagtacatcc 
attcaagagc 
cttgatcgtc 
gaaattgctc 
agtagtttgt 
gaagacccta 
ctccaagaag 
gagttagaaa 
agatgaagct 
atctaaacta 
aaggcggcag 
tctagaggaa 
acaggtagta 
ttttgaatat 
gagaacagaa 
agaaggccag 
tctagctgca 
gaataagatg 
gagccttcta 
gaatcaaaga 
tcaaggctca 
tctagagaag 
tctcgagcca 
gaaagaaag 



cggcggccgg 
atcccccgac 
cggggaagat 
cttggatcca 
atggggttgt 
ggctaaacag 
agaaaatttt 
actttaccct 
ggatgcttca 
aagccattat 
tgatgagtaa 
agctgaagaa 
aaagatgcca 
tggcagagaa 
acagtccagc 
aaacattcag 
aaggagatct 
cagtctctga 
gaaggtcaag 
gttaaactct 
gagttaagaa 
gaactacaaa 
aagcggctaa 
agggattctc 
ctcacaacac 
gagattgtta 
ttaaagctta 
gatgatgcaa 
cttctggaag 
aaagcagaag 
ctgcatgagg 
agatttaaca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1969 



<210> 28 
<211> 2476 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 



15/228 



BNSDOCID: <WO 01 62927 A2_l_> 



WO 01/62927 




PCT/US01/06059 



<223> Incyte II 



LI : 1085273 .2 :2000MAY01 



<400> 28 



cgggccccgc cgggcccatg gacggcgcgg ccgagcgggc gccctgagcg cggtgcgggt 60 
cctcggagca cccccgaggt gagcgcaagc aaggtccggc atcatgtgct aggttattcc 120 
cagtgcgagg ccacacttgg gccgtcggag cagcccctcc tcacttcagg ggtcaccctc 180 
cccaagaccc attgccccat gatggccggg gaccggctcc ccaggaaggt gatggacgcc 240 
aagaagctgg, ccagcctgct gcggggcggg cctggggggc cgctggtcat cgacagccgc 300 
tccttcgtgg agtacaacag ctggcatgtg ctcagctccg tcaacatctg ctgctccaag 360 
ctggtgaagt ggcggttgca gaagggcaag gtgaccattg tggagttcat ccagccggct 420 
gcacgcagcc aggtggaggc cactgagcca caggacgtgg tggtctatga ccagagcacg 480 
cgggacgcca gcgtgctggc cgcagacagc ttcctctcca tcctggctga gcaagctgga 540 
cggctgcttc gacagcgtgg ccatcctcac tgggggcttc gccaccttct cctcctgctt 600 
ccccggcctc tgcgagggca agcctgctgc cctgctaccc atgagcctct cccagccctg 660 
cctgcctgtg cccagcgtgg gcctgacccg catcctgcct cacctc.tacc tgggctcgca 720 
gaaggacgtc ctaaacaagg atctgatgac gcaaaatgga ataagctacg tcctcaacgc 780 
cagcaactcc tgccccaagc ctgacttcat actgcgagag ccgcttcatg cgggtcccca 840 
tcaacgacaa ctactgtgaa aaactgctgc cctggctgga caagtccatc gagtacatcg . 900 
ataaagccaa gctctccagc tgccaagtca tcgtccactg tctggctggc atctcccgct 960 
ctgccaccat cgccatcgcc tacatcatga agaccatggg catgtcctcc gacgacgcct 1020 
acaggttcgt gaaggacagg cgcccgtcca tctcgcccaa cttcaacttc ctgggccagc 1080 
tgctggagta cgagcgcacg ctgaagctgc tggccgccct gcagggcgac ccgggcaccc 1140 
cctcagggac gccggagcct ccgcccagtc ctgccgccgg ggccccgctg ccacggctgc 1200 
caccacctac ctcagagagc gctgccacag ggaatgcggc tgccagggag ggcggcctga 1260 
gcgcgggcgg ggagcccccc gcgcccccca cgcccccggc gaccagcgca ctgcagcagg 1320 
gcctgcgcgg cctgcacctc tcctcggacc gcctgcagga cactaaccgc ctcaagcgct 1380 
ccttctccct ggacatcaag tctgcctacg cccctagcag gcggcccgac ggccccgggc 1440 
cccccgaccc cggcgaggcc ccgaagctct gcaagctgga cagcccgtcg ggggccgcgc 1500 
tgggcctgtc ctcgcccagc ccggacagcc cggacgccgc gcctgaggcg cggcccacgg 1560 
ccccgccggc ggccccggcc ccccgccggc tcccccgcgc gctcccccgc gcacagcctc 1620 
ggcctgaact tcggcgatgc ggcccggcag actccgcggc acggcctctc ggccctgtcg 1680 
gcgcccgggc tgcccggccc tggccagccg gccggccccg gggcctgggc accgccgctt 1740 
gactccccag gcacgccgtc gcccgacggg ccctggtgct tcagccccga gggcgcacag 1800 
ggggcgggcg gggtgctgtt tgcgcccttc ggccgggcgg gcgccccggg accaggcggc 1860 
ggcagcgacc tgcggcggcg ggaggcagcg agggctgagc cccgggacgc gcggaccggc 1920 
tggcccgagg agccggcccc ggagacgcag ttcaagcgcc gcagctgcca gatggagttc 1980 
gaggagggca tggtggaggg gcgcgcgcgc ggcgaggagc tggccgccct gggcaagcag 2040 
gcgagcttct cgggcaacgt tgaagtcatc caagttgtct ctgacccttc ggctgccttc 2100 
ggccccgccg cccgcagcca ggcccgtaat aaatgtatat tatatataat gcaaagaaag 2160 
gtaaatggtt taactgggaa ttttaatccg agaagtaaaa taatttcgat tttttattta 2220 
tttaagctgt tcattctggc aatggatttg gcaacagtgc gggtggtcct cgagcttcca 2280 
tttttactgt ctggtattta aactgaaaca tacgtttcta agcaatacga ggccaccttc 2340 
cagtcgcaag ctgggtgcca ggcctggggc ccctcccagt tcccccgccc caggaaacac 2400 
tgctgacctt tgcaaaggct gccgagcttt cgtgcacttt ttacataaca aaaaggtgaa 2460 
aaaaaggaaa aaaaaa 2476 

<210> 29 

<211> 2689 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI :333138 . 2 : 2000MAY01 



aggagctgga gtgggagctc aagcaggatt cttcccgagt ccctgggtaa gacaaccctg 60 
gcttcttttc ttggctttag agggtctcct tgcttaatgg gaagcgtgca gcacctagtg 120 
agtggatttg aagagccact ttgtaagcaa cttgggcatt tatttcagcc ccagttccag 180 
tcttccctga ctcttttggc atcaaggcat cctcagaagc ttcaactctg gaggcaatgg 240 
gtcgaaagga agaagatgac tgcagttcct ggaagaaaca gaccaccaac atccggaaaa 300 
ccttcatttt tatggaagtg ctgggatcag gagctttctc agaagttttc ctggtgaagc 3 60 
aaagactgac tgggaagctc tttgctctga agtgcatcaa gaagtcacct gccttccggg 420 
acagcagcct ggagaatgag attgctgtgt tgaaaaagat caagcatgaa aacattgtga 480 
ccctggagga catctatgag agcaccaccc actactacct ggtcatgcag cttgtttctg 540 
gtggggagct ctttgaccgg atcctggagc ggggtgtcta cacagagaag gatgccagtc 600 
tggtgatcca gcaggtcttg tcggcagtga aatacctaca tgagaatggc atcgtccaca 660 



<400> 29 



16/228 



BNSDOCID: <WO 01 62927 A£_L> 



\VO 01/62927 



gagacttaaa gcccgaaaac^BRjctttacc 

tcactgactt tggtctgtcc aagatggaac 
ccccaggcta cgtg^ctcpa gaagtgctgg 
gctggtccat cggcgtcatc acctacatat 
aaacggagtc taagcttttc gagaagatca 
tctgggatga catttctgag tcagccaagg 
cgaacgaggg' ggtaacactg tgagaagggc 
gccctccacc gggacatcta cccatcagtc 
gagcaagtgg aggcaagcct tcaacgcagc 
acatgaacct gcacacgccc gggcgtccgc 
caagcctcag aaacctctag acccagctcc 
ctggaccaca gtgtagcact ccctgcctct 
cggcccactg cccttggtgg caggtccctc 
age age age c tggtgcccat gcatcagggg 
ctgaacattg ggagcaaagg 
gecaacaaaa aacagaactt 
cttccactgc egggcaggge 
gectatgtea cgtgcaattt 
gtgtctatcc ggcagaggga 
ttctgggcca gaagcaccat 
aggagggagc cccaaggcgt 
caccagcttc cagtgtctcc 
cgtggctctg tgcagtgtta 
gaaaagetta aatgggctgg 
geccatatgt gagcatctat cccagagatc 
taaagtctcc aacctcttgg ecagattgeg 
catgaatacc agccagctcc ccatggtggt 
ttaactccag gattagctcc 
ttctccctgc ctcaatctaa 
tcatgagtaa gaggctgeaa 
tgcacagcag cccaaatgga 
ggaggttcta agtcctgcct 
tccatatctt ggtccgttcc 
atgetcagag ttgtgcttaa 

<210> 30 

<211> 1391 i ; 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 338927 . 1 : 2000MAY01 



tccaagctgc 
cctcaaaaag 
cagtggcatg 
tggagcctgt 
tccaaacctg 
caggagcagt 
taggaggece 
aagcggtgcc 
ctacagtggc 
tgtttgtcgt 



caagtgeget 
aagcagtgcc 
cgttgaacta 
cgttaacctt 
ctggttcccg 
ccaggtatgt 
taaaatggac 



ttacccctga 
agaatggcat 
cccagaaacc 
tgctctgtgg 
aggagggcta 
actttatttg 
ttgagtcatc 
aggegtccag 
agctgtggta 
ccagaggtgg 
cctgagatca 
gacccaatat 
aactgectgg 
tccctggccg 
aaagtcctcc 
caagteggag 
agactggagt 
tccaggagac 
ggaaggcaga 
gcctgcttgc 
agaagecttg 
cgtgacctgg 
cgttagatag 
gecaggctgt 
ecactctgtg 
ctcattaatg 
ctccctggtg 
gagaaccaag 
acaccctcca 
cttagtccat 
ggaagttgac 
ttttctctcc 
cctcggcccg 
atatttttct 



agagaactc 

catgtccact 

ctacagcaag 

atacccccca 

ctatgagttt, 

ccacttgctt 

cctggattga 

atccagaaga 

gcacacaatg 

agaacaggee 

ccatcaccga 

accctgccca 

tcaatggctc 

ccgggccctg 

tactgetctg 

gtcatggtac 

ctgtctcatt 

atattcaact 

gcaagtggag 

cagctggggc 

gttgaagctg 

cctgctctat 

ctctcgccct 

gtcaccttct 

cacactctct 

gtcgttgcct 

agctcttcaa 

caagcacact 

aaggtggaat 

gagtccagca 

ttatatttaa 

ttgaaatgtg 

tcgcaccccg 

ctaaaaaaa 



PCT/US01/06059 



gat cat ga 
gcctgtggga 
gctgtggatt 
ttctatgaag 
gagtctccat 
gagaaggatc 
eggtaacaca 
actttgetaa 
aggaagctac 
gectgaaact 
ggcacctgtc 
gcatcgccgc 
cctccacatc 
tggctgctgc 
agcccacact 
eggttaaage 
atgtgattcc 
cctctgctcc 
cagggcttag 
agccccgtca 
tgagcaggag 
gccccacacc 
gggtctgtgc 
ccaaagcaaa 
ctcacatctc 
gcccatctgc 
gttctaatcc 
ttcttggcec 
agaaagaagt 
ctgtcctccg 
gtgtctgcca 
ccagtcgaca 
tccaggcttc 



720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2689 



<400> 30 

gtgagecgae 

atctgtccca 

aegggtaget 

ctggcatctg 

cccagaggcc 

ccaggaccac 

agcaggaggt 

ttgccagcct 

ggagctgett 

tgtttttcca 

tcagggtcat 

cagagctgea 

accacagccc 

cctaaggcgc 

cacagtgaag 

ctggggaaag 

tgaaaataag 

attaactccc 

gcatagtacc 

ttacttgect 

agaagaatgc 

tacctggggc 



cttctgagcc 
ggccagcgtg 
gagatccacc 
tgtccaacct 
agaggctaga 
gggagaagac 
acctgaactc 
gtgaccctct 
ggagacatga 
ggagctgctg 
cttctccaac 
gcggcgcctg 
tggccaggtg 
caggccctta 
ggcttgggtg 
gggtggatca 
ccagtgccaa 
tcaatcttca 
ctcttgtttt 
tatatagttt 
tagtaaattg 
atgcatattt 



ctcaagaaag 
gctgagtgtg 
ccgagaaacc 
ggtcactgtg 
ggacgtgcat 
gaatgtcggg 
cctgaagaac 
caggateggg 
cgcaggccat 
aagacagccc 
atctcctcca 
gacgactggt 
gaggttgggg 
attgtgtggc 
aatgagcttc 
ggtcctggcc 
ctgagggctt 
catcagcctg 
ctacagatgg 
gecaegttag 
ttaaacttag 
tttaagagta 



atcagaacag 
ctggcctgga 
ggcaggatga 
tttgagaata 
caccgacctg 
gaggccgtgg 
aagctgtcca 
gaegcaggag 
atgtggcgcg 
gcagcagcaa 
tctatcagtt 
gaggtccacc 
agaccccagg 
catgcctccc 
tgagaggcat 
aagccaagtg 
c-ctacacgcc 
ctgtgtgtat 
gcccacttat 
ttttggatgt 
tgatgtattg 
ccccctggag 



attcatgggt 
ggcctctctc 
agggggcaag 
gcaggacccc 
agtgcaggcc 
ggtctgagee 
gcgaagcctg 
ccagagaaga 
cctccacctg 
ggccttccca 
ccattctcag 
aggagcccct 
gcgctttaag 
tggggctegg 
gtgtgtgtgc 
accaagtgac 
tggccttggg 
gtaeggctat 
agctattgta 
cctttataga 
attacaggcg 
tttaatatat 



gatttagect 
tetgettcag 
tgaggagaag 
agaagcagca 
tcccgagtcc 
caggacagtc 
gaggaaatcc 
agategtcca 
ctagaccagg 
gaggatgtgg 
ttcttccbcc 
gaggectcag 
gttcaagect 
tcactgtctg 
agactgette 
tttgggacag 
ctttccctct 
catggttatg 
acagtgttag 
tctttggcaa 
tgagcoacca 
agcctgggtt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 



17/228 



BNSDOCID: <WO 0162927A2_I_> 



WO 01/62927 

acctgggatg aag, 
tgccagtcat g 



ag agagcaaact accccaagca atg< 



PCT/US01/06059 



gc ccctgtcaag 1380 
1391 



<210> 31 

<211> 1886 

<212> DNA 

<213> Homo sapiens 

<220> 

<22i> misc_feature 

<223> Incyte ID No: LG: 335558 . 1 :2000PEB18 



<220> 

<221> unsure 
<222> 1844 
<223> a, t, c, 



g, or other 



<400> 31 

cattagcgct 

agcgcctctg 

gattttccgc 

ccacagcctc 

cctcttgcgc 

actcctgctc 

agctatgatg 

gtgtcacttt 

ctaaataact 

cctcagcctc 

ccacaccacc 

caccatccat 

ccagtatcta 

acttctgctg 

agtactacag 

taacattact 

taatataagt 

cagtggtgtt 

tgctgttagc 

gcagcaacaa 

tgagcccttt 

acctgctaca 

agaagtgact 

actgagtcac 

gcagcagcag 

cc tccaacag 

acagagtatt 

tcagcaaaag 

tatccagcca 

catttccagt 

tccccttcca 

ttctacacag 



aggaagatgc 
ctctcaatgc 
ctccgtcgct 
cgcctccaca 
caggcggaac 
agatctccgc 
atctggatga 
ccagggctac 
tccaggaagc 
atttgcctca 
tccatcacca 
ctcatgttgc 
gaaaactctc 
tatcatccag 
gtggaactag 
gctgtgggta 
acaagcaata 
aatgtgaata 
agtgttccta 
ccaacagtta 
aaaaaaggta 
gaaggtgtgc 
tctgaaaggg 
tatacagaga 
cagcagcaac 
atggattttg 
tctcagtcac 
caaggtcttc 
tcgcctgtaa 
ttggctcaac 
ggggcaccac 
atggccccag 



gcacccggca 
agcaggtacc 
get t cage eg 
aagectgaac 
tcaaatgaaa 
tagtatcagc 
atctcacacg 
tgacttaggg 
cgagacacct 
ccttccacaa 
ccatcagatt 
tgtggccagt 
tacaactgga 
tggttcacct 
gtataaattc 
gttttaatcc 
ttcctagtgc 
tcttgagtgg 
atgcagctgc 
acacttcgag 
gatggacttg 
tgataaataa 
agagcactag 
gtgtgggaag 
aacgactact 
gtagcactgg 
agatctcaca 
agccagtacc 
atgtggttgg 
cccagctacc 
caccccaaca 
gcatga 



atgttccctc 
ggcgtcggta 
ccgccccctg 
ctcctttcgc 
aagaaaagtg 
tctaacaaca 
gaagatctct 
gagcccgaac 
ggggcagtct 
cagaatgttg 
catcatgggc 
gcatccatta 
agctctgaca 
gcatctgtaa 
tgttactggc 
taatgtgaca 
tgctggtgtg 
catgggcaat 
agggatgact 
gttcagagtt 
cactgagttc 
agtggtggag 
tgggagttca 
tggagagatg 
acagcaacaa 
tccacagagt 
agtacaatta 
tctgcaagcc 
tgtaacttca 
atattctcag 
gttacagtat 



gaaggggcag 
gtaatgecac 
cagcatcttc 
aggctcagct 
gcttccagat 
gtatagcaga 
ettcttegga 
gcagctcctc 
ctcccaacca 
tgatcaatgg 
accacctcca 
ctggtgggcc 
gtatcacacc 
tgactaatag 
actagtacag 
ageagcatge 
agtgttgggc 
ggtactattt 
gggggategg 
gtgaagttag 
tatgaaaaag 
actgtaaagc 
gtgagcagta 
ggagccccta 
ccagctctcc 
attccagcag 
cagtctcaag 
actatgagtg 
gctttaggtc 
gcggctcctc 
gganaacagc 



cggtagtggc 
atcttccgag 
tacgtcggga 
gcaggcacag 
aactagegtt 
ggacactgag 
gatccttgat 
agaagagacc 
gccccacctt 
gaatgetcat 
acatggtcac 
accctcaagc 
agttgcacca 
gcgtgctcca 
taaataatgt 
ttggtaatgt 
ctggagttac 
cttcctctgc 
tttcaagtca 
attctagttc 
aaaatgctgt 
aaaatccgat 
gtgtcagcac 
ctgtggtggt 
aaggtgtgac 
ttagtatacc 
aactgagcta 
ctgcaactgg 
agcagccttc 
cagtgeaaac 
aaccaatggt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1886 



<210> 32 

<211> 1540 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
<223> Incyte ID No: 



LG:998283 .7 :2000FEB18 



<400> 32 

ctttactgta 

tgtactaact 

gatgtgtaca 

tgtatctgea 

aattggtacc 

tatgtctgtc 



tctgtaaaac 
ggtatcatgg 
tctgtaatga 
gaacacctta 
atgggcgctg 
cacagtgcca 



gecttatgat 
agaatgtgtt 
ttgtaaacgg 
tgatgagtca 
cgttggcatc 
gtcaacagag 



gaatctaagt 
ggcatcacag 
gcacaagagg 
caattttata 
ttgcaaagtg 
gatgecatga 



tctatattgg 
aaaaggaggc 
gcagcagtga 
ttggctgtga 
aggcagagct 
cagtgctcac 



ctgtgatctt 60 
taagaaaatg 120 
ggaattgtac 180 
tcggtgtcag 240 
cattgatgag 300 
gccactaaca 360 



18/228 



BNSDOCID: <WO 01 62927A2_I_> 



WO 01/62927 



m c 



gagaaggatt atgaggggtt^P&gagggtg 

tggcctttcc ttgaaccagt agaccctaat 
gaacctatgg accttgccac catggaagaa 
acggaatttg tggcagatat gaccaaaatt 
gactccccat tttaccagtg tgcagaagtt 
ggcttcaaag ctagcaggtc ' tcataacaac 
gcgtgttaac' ctaacataaa acacagcaag 
ggagccagat gtttt'tagtc aggctatcct 
tggtcataac agtccaatta tattcttggc 
aagtcaacga caccattatc ttgtcaagat 
agaaaatttt gtttattaaa aaaaaaaaag 
gggtcaagta taactccatg gaaatgccac 
agtctcaaag aaaacttttg actgtattta 
tgctgccctc atttgtcaac taattatttt 
aatccatcca tatcttatca atgattcctg 
tcttttcttt ctatgaaagg agctgctatg 
tcaacaatga gtttattgtt catggtagat 
agtggtcctg gactacagac tctggtgact 
ccacaccagg ttgaaatgag taaaatctat 
agattctaat ttttcttttt gtattaaaat 

<210> 33 
<211> 613 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 402739 . 1 : 2000FEB01 



ctccgttcct 
gatgcaccag 
agagtacaaa 
tttgataact 
ctcgaatcat 
aaactgcagt 
aatctggttg 
gacaagactt 
caattttgtc 
cagatggttt 
aaaaagaaag 
gtctgctctt 
tttattgttg 
ttcttataaa 
tacgtaaaag 
tacacatgtg 
taaaattaag 
tgaatataac 
ttgaaggtat 
tcaactatgg 



PCT/US01/06059 



tacaggcccs^BTagatggcc 

attattatgg^gttattaag 
gacgatatta tgaaaagctg 
gtcgttacta caatccaagt 
tctttgtaca gaaattgaaa 
ctacagcttc ttaaagttca 
tctgaactat tttaaattaa 
gacctaaact tcgtttttat 
caacggacaa gaaaaaagca 
tactatttgt ggcagaagcg 
caagaaacaa tgatactgtg 
cagtgaagaa gctggtttag 
caaaaaagat gcttttttat 
atccagccac ggttacatat 
tacaagacaa cctctagatg 
cacacacaca caactgcgaa 
cttgcataaa ggttgggcta 
agtacaattt gtcaattact 
cttctttgta aacatttgtc 



420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1540 



<400> 33 

ccgccgtggg 

aggacgatga 

cagaaacaga 

cgcctcaacg 

tccaagctgg 

ttcttccaag 

gaacgaaggc 

actcagttcg 

ctgtagtgat 

tgggtgttag 

tttataaaca 



ggcagagaag 
tcccgccggg 
ggcccgccgt 
ccgagttgga 
acaagctttc 
gacaggggtt 
ccatgtcgtt 
gtctgcatct 
gtgtgaggac 
ttgtaaattc 
aaa 



tccaacccct 
ggagtgcacg 
gggggcagag 
ccacctggcc 
tgtcctgcgc 
ggcagtggca 
caggaggccc 
gttgatggtg 
aatgaaggca 
atggttacat 



ccaagcgaca 
tacgcgggcc 
aagtccaacc 
agcctgctgc 
ctcagtgtca 
gatgccgaag 
cgagcattgg 
aatattgcag 
atggtgattc 
aacataataa 



ccggcgggac 
ggaagcggag 
cctccaagcg 
cgttcccgcc 
gttacctccg 
atgtggatga 
acacccaggc 
gactgatagc 
acaggcaggc 
atttacctac 



atcaaggccg 60 
gaggcccctg 120 
acaccgggac 180 
tgacatcatc 240 
ggtgaagagc 3 00 
ccacacgggc 360 
cctgagacgt 420 
cactgaccgc 480 
cagcaccata 540 
gaattcatta 600 
613 



<210> 34 

<211> 507 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: LI : 175223 . 1 :2 000FEB01 



<400> 34 

gcacattaga 

cagaagatca 

cgtggggcaa 
tgccggagcc 
tcaatgtgcg 
atattccagt 
taaaagaagg 
tacgggtaac 
agacactaca 



ccatgcgagc 
ctccgttcca 
catggccgaa 
ggcagaggag 
catgggattt 
cgatgtattt 
agaaccagtg 
aggacctggt 
gaaaagaaaa 



taaatttgtg 
aagggaaagt 
ggcggggcta 
gaatcccagg 
ggattcatct 
gtacaccaaa 
gaattcacat 
gggagcccct 
ccaaagg 



atcgcacaaa 
tttcatctca 
gcaaaggtgg 
ttttgcgcgg 
c cat gat aaa 
gcaaactatt 
ttaaaaaatc 
gtttaggaag 



atcaagatgt 
cgagtttgga 
tggagaagag 
aactggccac 
ccgagaggga 
catggaagga 
ttccaaaggc 
tgaaagaaga 



tagattgatg 60 
gc tgagggcc 120 
cccgggaagc 180 
tgtaagtggt 240 
agccccttgg 300 
tttagaagcc 360 
cttgagtcaa 420 
cccaaaggga 480 
507 



<210> 35 
<211> 653 
<212> DNA 

<213> Homo sapiens 



19/228 



BNSDOCID: <WQ 0162S27A2 I > 



WO 01/62927 _ PCT/US01/06059 



# 



<220> 

<221> misc_feat? 
<223> Incyte ID No: LG: 981076 . 2 :2000MAY19 

<220> 

<221> unsure 
<222> 642 

<223> a, t, c, g, or other 
<400> 35 

gggcaacccg tctggaggca gcgcgcccgc gggcagcctc gttgtggctc cttgggctct 60 
gttggcggcg gcgctagctt cggagtctcc cgcgcgcacc tcagccgcct cctagcggcg 120 
cggcgctcgc tcctacgcct aaaatgacca atgtgtgatt tcagtggaat aaatggcgtc 180 
caaagtcaca gatgctatag tctggtatca aaagaagatt ggagcatatg atcaacaaat 240 
atgggaaaaa tctgttgaac agagagaaat caaggggcta aggaataaac caaagaaaac 300 
agcacatgtg aaaccagacc tcatagatgt tgatcttgta agagggtctg catttgcaaa 360 
ggcaaagcct gaaagtcctt ggacttctct gaccagaaag ggaattgttc gagttgtatt 420 
tttccccttt ttcttccggt ggtggttaca agtaacatca aaggtcatct ttttctggct 480 
tcttgtcctt tatcttcttc aagttgctgt aatagtatta ttctgctcca cttctagccc 540 
acacagcata cctctgacag aggtgattgg gccgatatgg ctgatgctgc tcctgggaac 600 
tgtgcattgc cagattgttt ccacaagaac acccaaacct cntctaagta bag 653 

<210> 36 

<211> 589 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> incyte ID No: LI : 1008973 . 1 : 2000MAY01 
<400> 36 

cacatcccga ggaacagaag cgggcctgct cagcatgtct gcaaggaccg gcgttccaga 60 

ccaagatggc ccgctgctcc gaggacccag ctcgctccag gctgtgactc cacatcccga 120 

ggtcgccgcc tggcgaccgg gcagcgagga ccggccaccc cagactgcct gtgccgccgc 180 

cccgagctcg gacagagccg cgaccgcgag gacagcgacg cctgcacagc tctggcgaga 240 

tggcggcagg gtccactacg ctgcgcgcag tggggaagct gcaggtgcgt ctggccacta 300 

agacggagcc gaaaaagcta gagaaatatt tgcagaaact ctccgccttg cccatgaccg 360 

cagacatcct ggcggagact ggactcagaa agacggtgaa gcgcctgcgg aagcaccagc 420 

acgtgggcga ctttgccaga gacttagcgg cccggtggaa gaagctggtg ctcgtggacc 480 

gaaacaccgg gcctgacccg caggaccctg aggagagcgc ttcccgacag cgctacgggg 540 

aggctcttca ggagcgggaa aagggctggg ggcttcccag aaaacgcga 589 

<210> 37 

<211> 1315 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc__f eature 

<223> Incyte ID No: LI : 1190250 . 1 : 2000MAY01 
<400> 37 

tagggaatgg ctggggttgg ctgaagagcg cacagtggag ttttaatgtc cgccatgttg 60 
agccatggcg tattgaagag agcgagcgag agaggagcgg agcggcacag cctcccaccc 120 
tcccggctgg tgttagtgcc cggacggcgg gctctgcgct ccgcccctca agtccccggc 180 
agcggttggc gagtggggac cgaacccccg gttctccatg atcccgctgg ccggggccgt 240 
ttcccccaga gcggagaggt atctgctgcg cctgagatga gtaaactgtc gtttcgggcg 300 
cgggcgctag acgcctcgaa gccgctgccg gttttccgct gtgaggatct gcccgacctg 360 
cacgaatacg cctcgataaa cagggccgtg ccgcagatgc ccaccggaat agagaaggaa 420 
gaggagtcgg aacatcatct tccagcgggc ttatttcagc acagcaggtg tatggcgaga 480 
agagggattt atatggttat accggtccca gaggcagaaa gtaattattg cttactatga 540 
r agtctataat atcctgggga atttaagatg ccaaaagcag cttcatttca catacagcct 600 
tttagtttgg atgctgaaca gcctgattat gatttggatt tctgaagatg aagatatttg 660 
tgaatataac tgttaaaaga aaaatggaac acttctgccc ccttgcccat ttgtaggcaa 720 
atggattgac ccccctagaa aaaggcagtg gtcagcagcc agtcagtctg caggaagcca 780 
aactactgct aaaagaagat gatgaactaa ttagagaagt ttatgaatat tggattaaaa 840 



20/228 



BNSDOCID: <WO 0162927A2_I_> 



WO 01/62927 

agagaaaaaa ctgtcgaggg^^ptctctta ttccatcagt 
gttccagcac aaatgatcct tatgtggctt ttagaaggcg 
gaaaaaatcg caaaaatgat gaagcctctt acgtaaatat 
taagtcgagc tgttactatt ctagagatga taaaaagaag 
ctattgcact taacactgga aattatggaa 'aagaggtata 
gagatcatgt ctgaggttat ggcacagaga cagccaatga 
atcatcccta ttactatata gcagtctatt taaacaccat 
gagttcaaat gttaataagc aagattatag tccgatctta 

<210> 38 

<211> 1616 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 021371 . 3 : 2000FEB18 



ia atgc 



PCT/US01/060S9 



a aaa c a agagl^ppc gagat g 
tactgaaaaa atgcagactc 
gcttaagctg cgacgagatc 
agataaaagt aaaaggagag 
atttgggcga ctacaatgga 
aacctactta tgccatcccc 
ggaagcatat ggatgtgaag 
tccgaccgat acggt 



900 

960 

1020 

1080 

1140 

1200 

1260 

1315 



<400> 38 
gtgacagagc 
gctctaatcc 
ccatgctgcc 
tctgtgatga 
tcctggtgtg 
gcctctagtg 
cagctcctca 
agctggtgtg 
cactctgccc 
gtactattac 
, tcgctttctg 
ccagctcttc 
gcacatgttc 
tgtctaccgg 
gccagatccg 
catgcagaaa 
tgacaagtgt 
catccaggct 
caagaaggca 
cttttgtccc 
cacaggggcc 
cctgcgcctg 
ccctgaggag 
gaagcagcag 
acccccagag 
cctgggcggc 
gcagacagcc 



cgtgcctgga 
aggccaggtc 
ctctaaccac 
gtgggtggga 
tcagtggctg 
agttcccaga 
ggcccaggac 
agccagtcag 
cggcgccgag 
aagtcgccca 
tcccacgagg 
aagtgcctgc 
aaccacgtgg 
aaggacgtca 
actccaaagc 
agactcacgc 
ggaaagtcct 
gttgccaacc 
ctgctgaacc 
taccgcacct 
aagcccttcg 
cacgtacggt 
cccccctccc 
cacagtgcgg 
gcgacaactt 
gccaccatca 
ttggatcttc 



acccaggtct 
ctactgtgct 
aggctgtctg 
ggatattttg 
gcttgtgacc 
gtggacagag 
acctggtggc 
atgcagagaa 
gtcgaccttc 
aacoactttt 
acctgcgctt 
agtgcagcta 
gcagcaagcc 
ttcggcacgc 
tgagctcttt 
agcacatgaa 
ttaagaagcg 
gcaggttcaa 
accagttgtc 
tccgagagga 
cctgtgagta 
gccgacacgc 
gccgtcgccc 
cccctggacc 
tccagtcatc 
tctaccagca 
tgctgaacat 



cctgactggg 
gacagaatgt 
tgtcatgtga 
gctggcctct 
tggcatgtcc 
ccctccagag 
catgggcaag 
cgcagccccc 
caggcgcttc 
gaggcccttc 
ccacgtcaac 
tcgttcccgc 
ctacaagtgt 
cgctgtgcac 
cccctgccct 
gacgcacagc 
ctacaccttc 
gtgtgagttc 
ccacgtcagt 
cttcttgctg 
ctgccacttc 
aagcagcttc 
cttcttctct 
acctcccagt 
tgaggctccc 
aggagctgag 
gagtgctcag 



aggtcccagc 
cggctgttgg 
ctcaatgcac 
ggcccacttg 
gtggcaggtg 
ccacaggatc 
gtgagcagga 
tcctgcccgg 
ctaggcaaga 
ctgtgccgca 
tcccatgagg 
cgctggtcct 
gacgagtgca 
agccgggacc 
gtgtgtggcc 
actgagaagc 
aaaatgcacc 
tgtgagtttg 
gacaagccct 
tcccatgtgg 
agcacacggc 
gaggaatggg 
ctgcagcaga 
tccccaggac 
tcattgctct 
gagtcgacag 
cgggaactgg 



tctcgccgtt 
gcggtagtat 
ccatcagtgg 
tccacgtctg 
tggaaggaga 
ccgaggctcc 
cccctgtgga 
atgagcatga 
aataccgcaa 
tctgtggttc 
ctggcgatcc 
cgctcaagga 
gctacaccag 
ggaagaagag 
gtgtgtaccc 
cccacatgtg 
tgctcacgca 
tttgtgaaga 
tcaaatgcag 
ctgtcaagca 
acaagaagaa 
ggaggcgcca 
ttgaggagct 
-ctcctgagat 
gttctgacac 
cgatggccac 
ggggca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1616 



<210> 39 

<211> 521 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
<223> Incyte ID No: 



LG:475404 . 1:2000PEB18 



<400> 39 

ggcggttgct 

ttcttggctc 

agtattggga 

ggcttttgag 

tcagaagaat 

aagccaatgg 

aaatcatatc 

cagccaaatt 

atgcagtatt 



ttgtctctcc 
tgggaggccc 
catccctagc 
gatgtggctg 
ctctacagag 
aaagaccaga 
gtacagagac 
ccaaatcttg 
tgtggaaaag 



ttcgccggag 
aggtggctct 
tgacgccagg 
tgaactttac 
aagtgatgca 
atattgaaga 
tgtgtgaaag 
atctgaacga 
tctttgtacg 



ccggctcttc 
gcagcagcct 
acacccggga 
ccaggaggaa 
ggaaacctgc 
tcacttcgaa 
taaagaagat 
gaacatttct 
tcattccctc 



ctgttagtct 
ctgccaccct 
agccgaggaa 
tgggctttgc 
aggaacctgg 
aaacctggga 
ggtcagtatg 
actggatrtaa 



ccgctgctag 60 
gtgacctgcg 120 
tggactcggt 180 
tagattcttc 240 
cttctgtagg 300 
aagatataag 360 
gagaagttgt 420 
aaccatgtga 480 
521 



21/228 



BNSDOCID: <WO 0 1 62927A2_I_> 
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<210> 40 
<211> 848 

<212> DNA , 
<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: LG: 979406 . 2 :2000FEB18 
<220> 

<221> unsure 

<222> 407, 509, 590, 645, 689, 707, 829 
<223> a, t, c, g, or other 

• . i 

<400> 40 

aattcactgt ctgtagcatc tgctcctcca cagagggacc ctggaatggc gatggcactc 60 
ccgatgcctg gacctcagga ggcggttgtg ttcgaggatg tggctgtgta cttcacaagg 120 
atagagtgga gttgcctggc ccccgaccag caggcactct acagggacgt gatgctggag 180 
aactatggga acctggcctc actaggcttt cttgttgcca aaccagcact gatctcccta 240 
ttggagcaag gagaggagcc gggggccttg attctgcagg tggctgaaca gagcgtggcc 300 
aaagccagcc tgtgcacaga ggaccctaat acactgccca gcagaagcca ggaaggaagc 360 
cctgccagct cagaaggtgg gccaggagag aagggtgtgg ccggaanggt agccggtggg 420 
ggtgctgcat cctcttggcc ccacggggag caccctgtca cccctaacag atgaagagaa 480 
ggtgaggggg gattgagtta cctgcccant ggtgtgtcaa actccatgat gggggtggga 540 
acatgcagac ctggcccctc caaggcccac tgctccccgg gctgtgcgcn ccacagtctg 600 
tgccaggggc tgcagaaaga gccatgggaa gcctggccct ggtanggcac caagatgggg 660 
agggctggat ccaccctcca gagaacaana atcctgggca gtagccngga catggtgggg 720 
gaagctgaag ggttgtccct gagcatcccc aacatggagc tcggaagtct gggccatggg 780 
caggaaagca cgaaggtcag ggccagcctg gggaaccaag agctgtgcna cgatcccttc 840 
cacctggg 848 

<210> 41 

<211> 1307 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> irtisc_feature 

<223> Incyte ID" No: LG:410726 . 1 :2000FEB18 
<400> 41 

tgctgttgcg gatgctgatg cgtttctgca ggactgtaaa ctggattcct ggaacctttg 60 
atattcctgg ctgtgtatag tgcctgttgg tggactgtac tgatactcaa ctagagtgtg 120 
aagggactgg attcctgccc ctgagacaca atgcaagctg tagtgccctt gaacaagatg 180 
acagccatct caccagaacc tcaaactctg gcctcgactg aacaaaatga ggtcccaaga 240 
gtggttactt ctggggaaca agaagctatt ttaagaggaa atgctgctga tgcagagtct 3 00 
ttcagacaga ggtttaggtg gttttgttac tcagaagtag ctggacccag gaaagctctg 360 
agtcaactct gggagctctg caatcagtgg ctgagaccag acattcacac gaaagaacag 420 
attttagagc ttctggtgtt tgagcagttc ctgaccattt tgectgggga gatcaggatt 480 
tgggtaaagt cacaacatcc tgagagtagt gaggaagtgg tgaccctaat agaagatttg 540 
acccagatgc ttgaagaaaa agatccagtc tctcaagatt ctactgtttc ccaagaggag 600 
aactcaaaag aggataaaat ggtcactgtt tgtcccaata ctgagtcctg tgaatctata 660 
acattgaagg atgtagctgt gaacttttca agaggagagt ggaagaagct ggagcctttt 720 
caaaaggagc tatataagga agtgctactg gaaaacctca ggaacctaga atttctggac 780 
tttccagttt caaaattaga gttgatttcc cagctaaagt gggttgaatt gccatggctg 840 
ctggaagaag tctcaaaaag ctcccgacta gatgaatcag ctttagataa aataatagaa 900 
aggtgcctca gggatgatga tcatggcttg atggaagaat cccagcaata ttgtggcagc 960 
tcagaggagg atcacggtaa tcagggaaat tcaaaaggaa gagtcgcaca atacaaaact 1020 
cttgggagtg gcagtagggg taagaaattt gacccagatt aaagcccctt tggacatgat 1080 
ttcaaagaaa cttcagacgt aattaaacat ctgagagtct acttgaggaa gaaatctcgg 1140 
aggtataatg aaagcaagaa acccttcagt tttcattcag accttgttct gaccagcaag 1200 
gagaaaaccg ccggagataa gtcacggaaa tctaatgatg gtgggaaagt cctgagtcac 1260 
tcttcagctc gtactgaaca tcagaaacgt cagaagattc atttttg 1307 

<210> 42 
<211> 2456 
<212> DNA 
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<213> Homo sapiens 

i 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LG: 200005,. 1 : 2000FEB18 
<400> 42 

ggagcctggt caacaagcca gcaccactgt gcfcaggaggc caggccggct ctgcagccga 60 
cgcatcccgt gtgajgcagac acacatagct cgctttcttg ggggctcttt gtcagccacc 120 
aacacagaga gcacacactg acacacacgt acacaaagat ctgagccagg ttgtgggagg 180 
gtatgttctg tcttgtgctt tttcttttag aagctactaa agggtgttgg ggatgcttct 240 
gactattatg aaggccaaaa ggcctgttga ctggggctgc ttttaaccct ttcctatttg 3 00 
ctgagaatgc agccgtgtga cagtaactga acattggtct aaagtctttc caaaaggtca 3 60 
aggttcacaa gaacatctgc tcaaattaat gaccatgggg gatatgaaga ccccagactt 420 
tgatgacctc ctggcagcat ttgacatccc agatatggtc gatcctaaag cagctattga 480 
gtctggacac gatgaccatg aaagccacat gaagcagaat gctcacggag aggatgactc 540 
ccacgcacca tcatcttctg atgtgggtgt cagcgttatc gtcaagaatg ttcggaacat 600 
tgactcttcc g&gggcgggg agaaagacgg ccacaacccc actggcaatg gcttacataa 660 
tgggtttctc acagcatcct cccttgacag ttacagtaaa gatggagcaa agtccttgaa 720 
aggagatgtg cc tgcctctg aggtgacact gaaagactcg acattcagcc ajgtttagccc 780 
gatctccagt gctgaagagt ttgatgacga cgagaagatt gaggtggatg acccccctga 840 
caaggaggac atgcgatcaa gcttcaggtc gaatgtgttg acggggtcgg ctccccagca 900 
ggactacgat aagctgaagg cactcggagg ggaaaactcc agcaaaactg gactctctac 9 60 
gtcaggcaat gtggagaaaa acaaagctgt taagagagaa acagaagcca gttctataaa i020 
cctgagtgtt tatgaacctt ttaaagtcag aaaagcagag gataaattga aggaaagctc 1080 
tgacaaggtg ctggaaaaca gagtcctaga tgggaagctg agctccgaga agaatgacac 1140 
cagcctcccc agcgttgcgc catcaaagac aaagtcgtcc tccaagctct cgtcctgcat 1200 
cgctgccatc gcggctctca gcgctaaaaa ggcggcttca gactcctgca aagaaccagt 1260 z 
ggccaattcg agggaatcct ccccgttacc aaaagaagta aatgacagtc cgagagccgc 1320 - 
tgacaagtct cctgaatccc agaatctcat cgacgggacc aaaaaaccat ccctgaagca 1380 
accggatagt cccagaagca tctcaagtga gaacagcagc aaaggatccc cgtcctctcc 1440 
cgcagggtcc acaccagcaa tccccaaagt ccgcataaaa accattaaga catcttctgg 1500 
ggaaatcaag agaacagtga ccagggtatt gccagaagtg gatcttgact ctggaaagaa 1560 
accttccgag cagacagcgt ccgtgatggc ctctgtgaca tcccttctgt cgtctccagc 1620 
atcagccgcc gtcctttcct ctccccccag ggcgcctctc cagtctgcgg tcgtgaccaa 1680 
tgcagtttcc cctgcagagc tcacccccaa acaggtcaca atcaagcctg tggctactgc 1740 
tttcctccca gtgtctgctg tgaagacggc aggatcccaa gtcattaatt tgaagctcgc 1800 
taacaacacc acggtgaaag ccacggtcat atctgctgcc tctgtccaga gtgccagcag 1860 
cgccatcatt aaagctgcca acgccatcca gcagcaaact gtcgtggtgc cggcatccag 1920 
cctggccaat gccaaactcg tgccaaagac tgtgcacctt gccaacctta accttttgcc 1980 
tcagggtgcc caggccacct ctgaactccg ccaagtgcta accaaacctc agcaacaaat 2 040 
aaagcaggca ataatcaatg cagcagcctc gcaacccccc aaaaaggtgt ctcgagtcca 2100 
ggtggtgtcg tccttgcaga gttctgtggt ggaagctttc aacaaggtgc tgagcagtgt 2160 
caatccagtc cctgtttaca tcccaaacct cagtcctccc gccaatgcag ggatcacgtt 2220 
accgacgcgt gggtacaagt gcttggagtg tggggactcc tttgcacttg aaaagagtct 2280 
gacccagcac tacgacagac ggagcgtgcg catcgaagta acgtgcaacc attgtacaaa 2340 
gaacctcgtt ttttacaaca aatgcagcct cctttcccat gcccgtgggc ataaggagaa 2400 
aggggtggta atgcaatgct cccacttaat tttaagccag tcccagcaga tcaaat 2456 

<210> 43 

<211> 738 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID NO: LG : 1076828 . 1 : 2000FEB18 
<220> 

<221> unsure 

<222> 74, 161, 279, 339, 346, 362 
<223> a, t, c, g, or other 

<400> 43 

actcctaagc ctcaagtgat ctccttattg gaacaaggga aagagocctg gatggttggc 60 
agagagctta caanaggcct gtgttcagat ctggaatcga tgtgtgaaac caagttatta 120 
tctctaaaga aggaagttta tgaaatagaa ttatgccaga nggagataat gggacttaca 180 
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aagcatggcc ttg|Jfctc cagttttgga gatgttttgg aat^J^raag ccaccttgca 240 
aaacaactgg gatatccaaa tgggcatttt agtcaagana tattcactcc tgaatacatg 300 
cccacattta ttcaacagac attccttact ctccatcana taattnataa tgaagacaga 360 
cnctatgaat gtaaggaatg tggaaagatg tttagtcatg gctcacaact tactcaacat 420 
cagagaatac acactggtga gaaaccctat caatgtaagg aatgtggaaa agcgtttaat 480 
cgtggctcac tccttacacg acaccagagg attcatactg gtgagaaacc ctatgaatgt 540 
aaagaatgtg gaaaaacctt tagtcgtggc tcagaactta ctcaacatga gcgaattcac 600 
acagctggag ctccgttact ctcatgggga ctttgagaat ggtatctggg tgttatggat 660 
catggagcag aaatacagaa ggagtctgca accctgatga ctttgtggaa cagccatact 720 
agctctggtc tgcttgcc 7.38 

<210> 44 

<211> 310 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG-.1076931 . 1 :2000FEB18 
<400> 44 

gggcggtgaa ggtcagagtg cagacctgag accactgctc accgacttca gactccagtt 60 
tatctgtgcc ccagcttcct cacttagtct cagaagactt aggctgaggc ctcagaagga 120 
gatctccatc ctttgtccag agcagaacag gatggccatg tcccaggaat cattgacctt 180 
caaggacgtg tttgtgggct tcaccctgga ggagtggcag caactggacc ctagtcagag 240 
ggctctgtat agagatgtga tgctggagaa ctacagcaac cttgtctcag tggggtattg 300 
tgctcacaaa 310 

<210> 45 

<211> 404 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 1078121 . 1 : 2000FEB18 
<400> 45 

tgctcgggga agctcaggtg gccgccacag cctcttgccc tctgacctgc aggtatcagg 60 
agatccatag ctaagcagcc tggacacccg ggaagctggg aaatggtgtg atttgtccag 120 
ataatctcac ctgagaagga atcccagaga agaaggaaga agaggaagaa atggctggtt 180 
ctcagggact gttgatattc agggatgtgg ccatagaatt ctctccggag gagtggagct 240 
atctggaccc tgctcagcag aatctgtata gggacgtgat gttagaaaac tacagaaacc 300 
tggtctccct gggtattgct gtctctaagc cagaactgat cacctgtcta gagcaaagga 360 
atgagccctg gaatgtgaag aaacattgag acagtagcca gaca 404 

<210> 46 

<211> 1179 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 1079203 .1 : 2000FEB18 
<400> 46 

gggagctcta ggtctagtct ttactgctct gtgtattctg ctcctagagg cccagcctct 60 
gtgactccgt tatctggcag gtattgggag atgcacagct aagatgccag gaccacctgg 120 
aagcctagaa atgttatata ttctcatttc actgaagacc tttggccaga gcatagcata 180 
aaagattctt ttcaaaaagt gatactgaga ggatatggaa aatgtggaca tgagaattta 240 
caattaagaa taagttgtaa aagtgtggat gagtctaagg tgttcaaaga aggttataat 300 
gaacttaacc aatgtttgag aactacccag agcaaaatat ttcaatgtga taaatacgtg 360 
aaagtctttc ataaattttc aaattcaaac agtcataaga aaagaaatac tggaaagaag 420 
gttttcaaat gtaaagaatg tggcaaatca ttttgcatgc tttcacatct aacacaacat 480 
ataagaattc acactagaga gaattcttac aaatgtaaag aatgtggcaa agttcttaac 540 
caatcctcag agcttattaa acataagaaa attcatactg gagagaaacc ctacacgtgt 600 
gaaaaatgtg gcaaaacctt taaccagtct gcaaatcttt atgcacataa gaaaattcat 660 
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actggagata 
cattcttaaa 
cggtctttaa 
tctaaaatgt 
gagtaattat 
ctagtcagtt 
caaaacttgt 
ttgtacaaac 
gcaaataaat 



a^^c 



aaaccataca^gtgtaaaga 
cataagagaa ctcataccag 
atgctcctca atcctttcta 
gaaaaatgtg agaaggcttt 
attgatgaga agccaaatta 
cttaatgtaa ttcctactga 
aaccaatgct tagcattttt 
ataaaaatat acaaaagcta 
gtatttctgt tgaaagacct 



atgtggcaaa 
gggaatgtct 
atcataagat 
^aatcacacc 
aaattaaaaa 
agaaaacacc 
gcacatgata 
ttgtctactt 
ttagaaata 




PCT/US01/06059 



gcctttaagt^Sfftactacag 720 

tacgaaggtg acgaatgtcg 780 
aattcataat gaagagaaac 840 
tcaatctgtt gtagacataa 900 
tgtggcaaag acttccaatt 960 
tggaaataca aaaaatgtgg 1020 
gtatttatat gtgagactac 1080 
gaaatttata gttaataaaa 1140 

1179 



<210> 47 

<211> 557 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: LG: 1082586 . 1 : 2000FEB18 



<220> 

<221> unsure 
<222> 42 
<223> a, t, c, 



g f or other 



<400> 47 

gaaaggaaaa 

gcagaccaat 

ttggaagaaa 

caagaagctc 

ataatcaaac 

agtcttgtgt 

gacctgcagg 

tgagatcatt 

tggacactgc 

tcttcctggg 



agattgaaga 
atatacatcg 
taatggctga 
aacaaactcc 
tgtcaaaaga 
ttacttctct 
tattgggaga 
gacatttagg 
acagcagaat 
tatattg 



aaagtaagca 
taggagttcc 
aaacttttca 
aagtatgatg 
caaacacagg 
gtttcttctg 
tccactgcta 
gaggtggcca 
ttatatagga 



cagcctagga 
agcagaagag 
agtttgataa 
aactcaaaga 
cctttgtctc 
cttctagagg 
agatgccagg 
tagaattatc 
atgtgatgtt 



gncccatggg 
agaaaggggc 
aagacatgaa 
tagccacatt 
tcactgcagc 
cccagcctct 
accccctgga 
tctggaggaa 
agggaactac 



acaatatcaa 60 
agagagaata 120 
t ataaacatc 180 
gagacatatt 240 
cagagctccc 3 00 
gtggccctgt 3 60 
agcctagaaa 420 
t ggcaa t gc c 480 
agaaacctgg 540 
557 



<210> 48 

<211> 353 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
<223> Incyte ID No: 



LG: 1082774 . 1 :2000FEB18 



<400> 48 

cagagatccg 

ctgtggccct 

gaagcctaga 

agtggcattg 

acagaaacct 

agcaaggaaa 



agctatcgat 
gtgacctgca 
aatgggacca 
cctggacgct 
gatcttcctt 
aaaaccttta 



cttaactgct 
ggtattagga 
ttgcaattta 
gcacagcgga 
ggtattgttg 
actatgaaga 



ctgtgccgtc 
gggtcacagc 
gagatgtggc 
atctatatag 
tctctaaacc 
ggcatgagat 



agctcctaga 
taggacgccg 
catagaattc 
ggatgtgatg 
aaacctgatc 
gattgccaaa 



ggcccagcct 60 
ggaccccctg 120 
tctctggagg 180 
ttagagaact 240 
acctgtctgg 300 
ccc 353 



<210> 49 

<211> 1027 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 



LG: 1082775 . 1 : 2000FEB18 



<400> 49 

ctctgctctg ccaacactag tggaatatga tcacatccca gggatcagtg tcatttaggg 60 

atgtgactat gggcttcact caagaggagt ggcatcatct ggaccctgct cagaggaccc 120 

tgtagaggaa tgtgatgctg gagaactaca gccaecttgt ctcagtaggg tattgcattc 180 

ctaaaccaga agtgatcctc aaattggaga caggcaagga gccatggata ttagaggaaa 240 
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aatttcgaag ccadj^/fcat 

aataccagta gaaactattc 
ttctgtgatg aaagcatgaa 
atgggaacag cttctggctg 
agcaaccttt tgaatacaat 
tacataagag agcttacaca 
gttatatgtc attttttatt 
gtaatgaatg- tggagaaagt 
acccattcag ccagaattta 
ttattgaata taatgaatgt 
agagaacaca tccaggggaa 
agaagtcagc ccacacaaga 
ggatgtggga agaccttgta 
gggagtt 

<210> 50 
<211> 364 
<212> DNA 

<213> Homo sapiens 



ctgggtgagt 
aagaatgaag 
ataattcatt 
aatgaagacc 
gaatgtggga 
ggacagaaaa 
actcatcagc 
atctttgagg 
aatcccactc 
ggaacctttt 
aaaccttatg 
catcagagaa 
taagaattca 



tagtgccaga 
ttcaatgagt 
ttgaagagga 
tcatttggca 
aagctttccc 
catgcaaata 
aaacacatcc 
aatccattct 
taattcagag 
tcagtgaaaa 
aatgtcatga 
cacacacagg 
gacctcatta 



tggl^^taa 

ttaacaaagg 
accttctgaa 
tcagaagatt 
tgagaattca 
tactgaacat 
aagagagaac 
ctttgaacat 
aacccactca 
attagccctt 
atgtggaaaa 
gaaaacccta 
gaaatcaaag 



agaattaatt 
tggaaaatgt 
tataataaca 
aaaaattggg 
ctcttccttg 
gggaaaacct 
cactatgaat 
cagaatgttt 
attagcaata 
catttacaac 
accttcaccc 
tgaatgtcac 
aattcacaca 



<220> 

<221> misc_feature 
<223> Incyte ID No: 



LG: 1083120. J.: 2000FEB18 



300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1027 



<400> 50 

cgccaggcct 

agaagaggaa 

cattcaggga 

agaaagcttt 

aggataattt 

gtctttctgt 

aaat 



acgagatttg 
gaagaggaaa 
tgtagccata 
gtactgggat 
tgctctagaa 
gggagcccgt 



acttgtaaag 
gaaaaggagt 
gaattctctc 
gtgatgttgg 
gttaagatct 
tccattcatt 



aat tact tat 
cagggatggc 
aggaggagtg 
agaactacag 
gccctcgtgt 
tgcactgaga 



attgactgag 
tcttacacag 
gaaatccctg 
gaacctggtc 
atttttgtat 
ctgaagcctt 



agagaagccc 60 
ggacctttga 120 
gaccctgtgc 180 
ttcctgggta 240 
tttctctgtt 300 
gttgagacac 360 
364 



<210> 51 

<211> 901 

<212> DNA 

<213> Homo sapiens 

<220> ' ' 

<221> misc_feature 
<223> Incyte ID No: 



LG:1087707 .1:2000FEB18 



<400> 51 

cctagcttct 

tagctcctcg 

tccataggga 

atgtggctgt 

tgtataggga 

ttaagccaga 

agaagacagt 

attggagcta 

tctgttgaac 

acatggattt 

tagagaaagt 

attagaatga 

gagattttga 

tattgctcct 

gcctgttgct 

c 



aggctctgag 
gtgactccac 
agaaggcgga 
agaattctcc 
tgtgatg.tta 
cttgattacc 
agccaaacac 
tatgtttctg 
atatgacttc 
aaatatatag 
aatcatattt 
acagataatt 
aagcaaatct 
ggcttagaat 
caccatagac 



tccagtaccc 
catagcccct 
acatccggag 
ccagaagagt 
gagaactacg 
tatctggagc 
ccagattatt 
aattctgaac 
tctcgtctgc 
atatatcact 
gtttatggtt 
ttacaagtgt 
attttctctg 
catcttgtgc 
ataattttct 



gtctgtacta 
gttatcctgt 
gctgggaaat 
gggaatgcct 
gaaacctgtt 
aaagaaaaga 
atgatgtttg 
aactatttat 
taaacacata 
gtaaaataaa 
gtgtacctac 
tgatcactta 
ctgtgtatta 
aaattctctt 
tttcataaaa 



ttccatctct 
gacctgcagg 
gcgagtgcta 
ggactctgct 
ctccttgggt 
gccttggaat 
taatgaagat 
aaaattttat 
cagaccttta 
cttcaggtgt 
tttgagaaga 
ccagcaaacc 
aattcattta 
tttttgttgt 
cattctttgt 



tccgctccat 60 
tactgggaca 120 
acattcaggg 180 
cagcagcgtt 240 
cttgctatct 300 
gcaaggagac 3 60 
tatgagtata 420 
cctacttttt 480 
gttttggttt 540 
aacagattta 600 
aaagaaaaat 660 
agaaacttca 720 
tctaaaatgt 780 
ttgtctgttt 840 
ataatcacct 900 
901 



<210> 52 

<211> 254 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc feature 
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PCT/US01/06059 



<400> 52 

ttccgttttc gcgtggttct tttgcaagct ctggattctc tggagtttga atgtttccag 60 
tattggaacc ccaccaagta ggactgatca ggtcttacaa ttctaaaacc atgacctgtt 120 
ttcaggaatt agtgacattc agggatgtgg ccatagactt ctctcggcag gagtgggaat 180 
gcctggaccc taatcagagg gacttataca gggatgtgat gttggagaac tacaggaacc 240 
tggtctccct ggga 254 

<210> 53 

<211> 597 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc__feature 

<223> Incyte ID No: LG : 1094230 . 1 ; 



<400> 53 

tgcagtcaga 

cctgcaggta 

gaccattgca 

acactgcaca 

tccttggtat 

ctttgaccat 

acagtgagag 

ccataatttt 

agttctcttt 

tttgttcatt 



gctccaggtc 
ttgggagatc 
atttagagat 
gcggaattta 
tactgtttct 
gaagagaaat 
ccaaagtcct 
tgggacgcac 
ttgcatcaga 
tttctgcaca 



tggttcttct 
cacagctaag 
gtggccatag 
tataggaatg 
aagccagacc 
gagatgatag 
cttcatgaca 
aaatgtctgc 
tctgaaatgt 
gtccattctg 



2000FEB18 



cctaaaggcc 
acaccgggac 
aattctctct 
tgatgttaga 
tgatcacctg 
ccaaaccctc 
tataagagac 
atgattttga 
ctgagagtaa 
tttttattac 



caggctgtgt 
ctcctggaag 
ggaggagtgg 
gaactacagt 
tctggagcaa 
agtgagcttc 
tgcacagtgt 
gaaactaaaa 
tagtttctgt 
tatctaggct 



ggccccgtgt 60 
c caaaaat gg 120 
cattgcctgg 180 
aacctggtct 240 
ggaagaaaac 3 00 
cttcaagttc 3 60 
ggctgctttt 420 
ttactctcaa 480 
tgaatttttt 540 
tgaaata 597 



<210> 54 

<211> 637 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: LG: 474848 . 3 : 2000FEB18 



<400> 54 

ggaagtgcgg 

ccctgcgggt 

ggaccagtgt 

tgaagtgcag 

aaatctctgg 

caggacctgg 

agcctggtat 

ggagcagagc 

ctccttttgg 

tgaaggacga 

aaactttatc 



cacccagagg 
cggtgcgagg 
tccccagttg 
ctatgtttcc 
agttggtgtc 
atgacgctca 
cattggggca 
catggatagt 
tagaggcttt 
cacatccctc 
tcgtaagcta 



cggtcttgta 
gcgaagagga 
tgggagcaga 
agtgttctct 
ctttgaggag 
gaggaccctg 
ttgcattacc 
agaagaaacc 
tttataccat 
cagagatgat 
cctatagttt 



gctgggccag 
acccgtgggc 
cgcgtgggcg 
ggctgtttcc 
gtagctgtgc 
tacagggacg 
aaacctgaga 
ctaaacctga 
ttgggaactg 
aaaagacttt 
acaaaaa 



cctggggctt 
ccgggggatc 
catcgcgggc 
aagagctaca 
acttcacctg 
tgatgctgga 
tgatcttcaa 
gactttcagg 
gtttggataa 
ttgtcttttc 



ggttctatgt 60 
ccggggggcc 120 
gggcagggcc 180 
agaaaagaat 240 
ggaggagtgg 3 00 
gacctacagc 3 60 
gctagagcaa 420 
ctaagcctcc 480 
gctaccttaa 540 
tgatttaaga 600 
637 



<210> 55 

<211> 754 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 251*656 . 1 : 2000FEB01 



<220> 

<221> unsure 
<222> 702 
<223> a, t, c, 



g, or other 



<400> 55 



27/228 



BNSDOCID: <WO 0162927A2J_> 



WO 01/62927 _ PCT/US01/06059 



tgaaaatggg gag^BJgta atcaagacat gtttgagaat gaa gta agatattctc 60 
ggaaatgcct gaaggtgaaa gtgctcagca ctccgatggg gaaagtgact ttgagagaga 120 
tgctggcatc cagaggctcc agggacacac cccaggtgag gaccacgggg aggtggtttc 180 
tcaggacagg gaagttggcc agctcatagg cctgcagggc acctacctag gggagaagcc 240 
ctacgaatgt ccccagtgtg ggaagacctt cagccggaaa tcccacctca tcacacacga 300 
gaggacccac acaggagaga aatactacaa atgtgatgaa tgtggaaaaa gctttagtga 360 
tggttcaaat tttagtagac .accaaaccac tcacaccggg gagaagccct acaaatgcag 420 
agactgtggg. aagagcttta gccggagtgc caacctcata acccaccaga ggatccacac 480 
gggggaaaag cccttccagt gtgccgagtg tggcaagagc ttcagcagga gtcccaacct 540 
cattgcacat cagcgcaccc acacaggaga gaaaccctac tcgtgccccg agtgtggaaa 600 
gagctttggc aaccgatcca gccttaacac gcatcagggg atccacactg gagaaaagcc 660 
ctacgaatgt aaagaatgcg gcgaaagttt tagttacaac tncaatctaa tcagacaaca 720 
gagaattcac acaggagaga aaccctacaa atgt , ,754 

<210> 56 

<211> 1601 * 

<212> DNA 

<213> Homo sapiens 

<220> j 
<221> misc_feature 

<223> Incyte ID No: hi : 021371 . 1 :2000FEB01 
<400> 56 

acgtctgtcc tggtgtgtca gtggctggct tgtgacctgg catgtccgtg gcaggtgtgg 60 
aaggagagcc tctagtgagt tcccagagtg gacagagccc tccagagcca caggatcccg 120 
aggctcccag ctcctcaggc ccaggacacc tggtggccat gggcaaggtg agcaggaccc 180 
ctgtggaagc tggtgtgagc cagtcagatg cagagaacgc agccccctcc tgcccggatg 240 
agcatgacac tctgccccgg cgccgaggtc gaccttccag gcgcttccta ggcaagaaat 300 
accgcaagta ctattacaag tcgcccaaac cacttttgag gcccttcctg tgccgcatct 360 
gtggttctcg ctttctgtcc cacgaggacc tgcgcttcca cgtcaactcc catgaggctg 420 
gcgatcccca gctcttcaag tgcctgcagt gcagctatcg ttcccgccgc tggtcctcgc 480 
tcaaggagca catgttcaac cacgtgggca gcaagcccta caagtgtgac gagtgcagct 540 
acaccagtgt ctaccggaag gacgtcattc ggcacgccgc tgtgcacagc cgggaccgga 600 
agaagaggcc agatccgact ccaaagctga gctctttccc ctgccctgtg tgtggccgtg 660 
tgtaccccat gcagaaaaga ctcacgcagc acatgaagac gcacagcact gagaagcccc 720 
acatgtgtga caagtgtgga aagtccttta agaagcgcta caccttcaaa atgcacctgc 780 
tcacgcacat ccaggctgtt gccaaccgca ggttcaagtg tgagttctgt gagtttgttt 840 
gtgaagacaa gaag;gcactg ctgaaccacc agttgtccca cgtcagtgac aagcccttca 900 
aatgcagctt ttgtccctac cgcaccttcc gagaggactt cttgctgtcc catgtggctg 960 
tcaagcacac aggggccaag cccttcgcct gtgagtactg ccacttcagc acacggcaca 1020 
agaagaacct gcgcctgcac gtacggtgcc gacacgcaag cagcttcgag gaatggggga 1080 
ggcgccaccc tgaggagccc ccctcccgcc gtcgcccctt cttctctctg cagcagattg 1140 
aggagctgaa gcagcagcac agtgcggccc ctggaccacc tcccagttcc ccaggacctc 1200 
ctgagatacc cccagaggcg acaactttcc agtcatctga ggctccctca ttgctctgtt 1260 
ctgacaccct gggcggcgcc accatcatct accagcaagg agctgaggag tcgacagcga 1320 
tggccacgca gacagccttg gatcttctgc tgaacatgag tgctcagcgg ggaccggggg 1380 
gcacagccct gcaggtgtgc tgcctgggaa cctgctcccc ttcccagcta ccgcagtacc 1440 
cagccctgca ctggacactg ggactagagg aaaacagtgt atcagagctt ctccgccctt 1500 
gggggctccc aggcagtgga ggagacagat ctgctgaagt ctggtgggca aacagagagg 1560 
aacaggcttt gccacggagg ccgcagggta tcccatccat t 1601 

<210> 57 

<211> 1059 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 133095 .1 :2000FEB01 
<220> 

<221> unsure 
<222> 107 

<223> a, t, c, g, or other 
<400> 57 



28/228 



BNSDOC1D: <WO 0162927A2_I_> 



WO 01/62927 



tggactgggc gaggaggtgc '^tgtgccat 
agagcccgag ggtagccagc ctagcaccat 
caagaaagag aaggaggacc tgtgcttgct 
cccggagctg gcaacaacgg tgcctgagag 
ggagctggac ggcagcgaca , tgtcagccat 
gaggcggcgg, agcaagcggt cgcgggtgat 
ctgcccatac gagggctgca gccaagtcta 
caatcttgtg catcggaaag gaaagaccaa 
gttctattta tccaaccacc tgcggcggca 
cacctgcgag acctgcggca agtccttcaa 
cacccacacc ggcgagaccc ccctgcagtg 
cgcgtcgctc aactggcaca tgaagaagca 
cgatcgctgc gggaagcgct tcgagaagct 
ccacccggat cacaagccca cctgacccac 
cgctcggaca ccacgcccgg gcttgccggg 
gcgggccgga aggaggcgcc cccgccccgc 
caccccgccc caccgcatcc ttctcggagc 
gtcaagagag cagagtgaga ttaaagagcg 

<210> 58 

<211> 1187 

<212> DNA 

<213> Homo sapiens 



gatggagggt 
ggacgccacc 
aaagaaggag 
cgcagagcct 
catctatgaa 
ggatgctgac 
cgtggccctc 
agtgtgccct 
catgatcatc 
gaggaagaac 
cgagatctgt 
cactgcggag 
ggacagcgtc 
ctgaccactg 
gcctggacag 
cccagagctg 
tggtgcctgg 
agaaaggag 



t^Wac 



PCT/US01/06059 



gtggcagcct W&cccagac 
gcagtancag gcatcgagac 
gagaaggagg agccagtagc 
gaggcagagg cggacgggga 
atccccaagg agcctgagaa 
ggcctgctcg agatgttcca 



agcagcttcc 
catcctggct 
cattcaggtg 
cacctggagg 
ggctaccagt 
gtgcagtaca 
aagttccaca 
accgccccta 
ctgcgagggc 



agaaccacgt 
gtggcaagaa 
tccgtgaatt 
tacatcggcg 
gccggcagcg 
acttcacgtg 
cgctcaaaag 
tttattcgtc 
cgcccggacc 



ggccccctgg gcaggttccc 
ggctgcattg ctggaactgt 



<220> 

<221> misc_feature 
<223> Incyte ID No: 



IjI:23 6654.2:2000FEB01 



<400> 58 

gcggccgctg 

agctgggctg 

tgaaggcctt 

gtcacgccca 

ccccgtcctg 

tgaggccaag 

gtgtgcaaga 

ggcgagcggc 

aaccgccaca 

agccaggagc 

agcacccttc 

gaacccgggg 

atcaccacgg 

cccaagtcag 

tttaccaaca 

acctgtgagt 

atgcacggca 

ggcctgcgag 

gcctgagccc 

tcctcgtttt 



cctgcagacc 
tcagctcttc 
gagctgcctg 
gtgggaccac 
ggcctggccg 
agcccccgtg 
agaccctcag 
cctatgcttg 
agaagaccca 
aggcctctgc 
catgcagcgg 
ctcctggcag 
aacaaagaac 
ggggcaagag 
gcagcaacct 
tctgcaacta 
tgacgcctgg 
ccaccctgga 
aggaaagccc 
tgcccgcctt 



ttcccgttgg 
agaggcccca 
cgctgtggca 
ggactgtcca 
aggtggctgc 
caagtggcag 
ctccttcagc 
cgaccagtgt 
ccggcaggtg 
agcccctccg 
tggtgagggg 
tggggctcaa 
tgaccctgca 
ccgcgggccc 
gacggtgcac 
cgcctgcgcc 
cagcacccgc 
caaacacctg 
ccctcactgt 
ctccaagtaa 



aggccatcac 
gccgcggcca 
aacagttcac 
tctaccagac 
agccgtgtcg 
cggcctcacc 
aacctcaaag 
ccctacgcct 
ccgccccaga 
gagccggctg 
gctggagccg 
gccggccctg 
aacagccaga 

gggggcagct 
cggcgctcac 
cagagcagta 
ttcgagtgcc 
cggcagaagc 
ccctggtacc 
attttccctt 



tgccttcatg 
gggctcagaa 
agtggcctgg 
agaatcagag 
gcagtggtgg 
cggcggagcc 
tgcacatgcg 
gcgcccagag 
gccccctcat 
tccatgctgc 
ccgccacagc 
gtggagacac 
aggcatcacc 
gtgagttctg 
acaccgggga 
agctcaaccg 
cccactgcca 
accctgaggc 
gctgccaaca 
ttattta 



gaccacaaga 
cgagaggagc 
aagctgctgc 
gccccggagg 
ggccagcagc 
cacctgtcct 
ctcacacaca 
cagcaagctc 
ggccgacacc 
tgcccccacc 
aggtgtcoag 
ttggggagcc 
caaaaagatg 
cgggaagcat 
gcgccoctac 
ccaccgccgc 
tgtgcccttc 
ggccggcgag 
cccattgacc 



<210> 59 

<211> 1671 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No : LI : 200009 . 1 : 2000FEB01 



<220> 

<221> unsure 
<222> 1663 
<223> a, t, c, 



g, or other 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1059 



60 

120 

180 

240 • 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1187 



<400> 59 

gcggccgccg cgcccccgcc ggagctgccg gagtggctgc gggacctgcc tcgcgaggtg 60 
tgcc-tctgca ccagtactgt gcccggcctg gcctacggca tctgcgcggc gcagaggatc 120 



29/228 



BNSDOCID: <WQ 0162927A2 I > 



WO 01/62927 



PCI7US01/06059 



cagcaaggca cct^jprtgg 
gcaggcgccg tgaggaacac 
aagccatgtt gccgaccaga 
ctaccaccca gcagcgctcc 
ttctggataa gtctgggccc 
gagtcctggc aagcccaact 
tttggaaatg tgggcagtgc 
tgtgcacgca gaaccccgac 
agccttcaga actgaggaac 
gctactgtgg tcgtgccttt 
ctggagaaaa gcccttcaag 
tgagccgaca ccagcggatg 
tcgaagtgga ttaacggatt 
gtcacggaca cttaagcaaa 
aaaccaaaag cagtaataaa 
tcagaccagg aaagagatta 
actaacctgg gatggttaac 
gctttttaaa aactgttatt 
cgttgtgact aagaatgcac 
ttcatgggga cccactgtac 
ctggctgccc cagcggggac 
cttcagcccc agacgcatcc 
acacggtttc ttccaaacag 
ggtttcagtt tcaaaattca 
caagaacatg taaaatgtac 
tgagaaacat ttgttttcaa 

<210> 60 

<211> 952 

<212> DNA 

<213> Homo sapiens 



acctttccaa 
gcagcatctc 
agcccttgag 
gttgttttcc 
attgaatcag 
tccacaagcc 
tttaagactt 
agaccctacc 
cacgtggtca 
gccggggcca 
tgcgagaggt 
cccaatgagt 
gatggttgga 
accaaagatt 
ataagtaaga 
tttatttatg 
atttccagtc 
taataccaaa 
agggacttgg 
agcccttcat 
cacggaagca 
catttccatg 
cccggtcttg 
aaggaaaaag 
ggagcttcat 
taacatttta 



ggcgtgcttc 
tgggagtaaa 
gcctttcacc 
cccagactcc 
gatttaatca 
agctccactc 
tcacccagcg 
aatgcggcca 
ctcactctag 
ccaccctcaa 
gtgagaggag 
gcaagccaat 
attaaactgc 
tcctctgagc 
tgttaagaga 
acttagggat 
ccaccatgta 
gggaggaatc 
ttctcgttgc 
tctgctgtgt 
gagtgagagc 
tcttccatgc 
atgcaggaga 
ttgaggactt 
aatacgttat 
gcttaaaaaa 



tgcl^^aga 

tgtcccttca 
aaaggcggca 
gtgcagcagg 
aatcaacgtg 
ggagttcagt 
gatcctctta 
ctgctcccag 
tgaccggcct 
caaccacatc 
cttcacgcag 
aactgagagc 
aaggaaagtc 
aactttcaat 
tattgatcct 
gagacttatt 
ttttgctttg 
gtatgggttc 
accttttttt 
cagtttggcc 
cttcgctgag 
tcactgctca 
gtctggaaaa 
attttgtcct 
attgttccga 
aanaaaggaa 



gaaggtgcag 
acggtaatgg 
aatctcgccc 
aacttctctc 
aaaaaccagc 
gactggcatc 
cagatgcacg. 
tccttttccc 
ttcaagtgcg 
cgaacccaca 
gccacccagc 
ccagaatcaa 
atgattaaat 
cagtcccaga 
ggcatggaag 
tcagtggaca 
tttctaaaaa 
ttctgcccac 
agtaacatgt 
tggcctgaca 
tcaatgctac 
tgcacttttt 
ggaagaaaat 
gtcaagattg 
agcagctcgt 



180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1671 



<220> 

<221> misc_feature 
<223> Incyte ID No: 



LI: 758502. 1:2000FEB01 



<400> 60 

caacactcat 

gtggaaagac 

agaagcccta 

ctcaccagag 

ttagccagag 

aatgcccgga 

ctcattcagg 

cacatttggt 

gtgggaaagg 

aaagacccta 

cccatcagcg 

tcaatcagcg 

actgtagtaa 

cccatttgat 

tactgtcttc 

tgatctattc 



caaacatcag 
ttttgggcgg 
cgcgtgcctg 
gacccacaca 
ctcggatttg 
gtgcgggaag 
agagaggcct 
cacgcaccaa 
attcgccgac 
caaatgtgga 
aatccactta 
ttcccatttc 
atgtaacaag 
ttaggaagta 
aagcacccca 
tccctctttc 



agaacccaca 
aagccacacc 
gaatgtcaca 
ggggtgaagc 
attaagcacc 
ggcttcagag 
ttcagttgtc 
agaacacaca 
agctccgccc 
gagtgtggga 
ggagacaggc 
ctcacacacc 
agcttccgtc 
gtctttggtg 
aatagagaaa 
ttgtctatgt 



caggggagag 
tcataatgca 
aaagcttcag 
cttacaggtg 
aacgaaccca 
atagttctca 
ctgactgcca 
caggtgagag 
tcactaagca 
agagcttcaa 
cctatcgatg 
agagaacgca 
agaaagcgca 
ttcagctgct 
acctgggcgt 
tataacagag 



accctatgag 
ccaaagaacc 
tcgaagctca 
taatgactgt 
cacgggagaa 
ttttgtagct 
caaaagcttc 
accttttaag 
ccaacgaatc 
tcagagctcc 
tcctgagtgt 
tacaggagaa 
tcttttatgc 
cccttgcaca 
cagtggctca 
aggatagact 



tgcccagagt 60 
cacacaggcg 120 
aatttcatca 180 
ggggagagtt 240 
cggcccttca 3 00 
cacatgagca 360 
agtcagagct 420 
tgcgaaaact 480 
cacaccggag 540 
cactttatta 600 
ggcaagacct 660 
aaacctttcc 720 
catcaagaca 780 
ttttcattgc 840 
atttgggccc 900 
ta 952 



<210> 61 

<211> 749 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 344772 . 1 : 2000PEB01 



<220> 

<221> unsure 
<222> 713 
<223> a, t, c, 



g, or other 



30/228 



BNSDOCID: <WO 0162927A2J_> 



WO 01/62927 



<400> 61 

cgcaaaaggc 

agccctacct 

attcctagga 

gaagctgagc 

tgaaaaatga 

ctttaaaatg 

atctctggag 

ccagcactct 

cgatgaggag 

cgtgattgtg 

gcctcactcc 

catggtctgt 

ggcccttcca 



tgtccctccc 
gcactgctct 
tgacctggga 
tgtcttcaac 
gtcagacctt 
gacatagaag 
aaggagttcc 
tctcctagcc 
tcaagcagac 
gaagattcat 
cctgggggca 
attggaccca 
ttgcaaccag 



aaacctggga 
ggtctagtac 
atttatcccc 
tgtattatat 
tccactgtaa 
actgcaatgg 
tcggggcccc 
gctcactcag 
tgctggggcc 
tgtctgagcc 
tccggctgcc 
acgtgctcat 
tgtggtgcc 



ttctgggctc 
aaacaggctg 
^ctgcactgc 
gtagacagaa 
agacaaacac 
ccgctcctat 
agtggggccc 
tgccaactcc 
agatgagcgg 
cctgggctac 
caatggcaag 
ggtgcacaag 



actgagttca 
ctggcattga 
tccccaagca 
ggtcaccatc 
a t gage t gag 
gtgtctggta 
teggtgagea 
atcaaggtgg 
ctcctggaaa 
tgtgatggga 
ctcaagtgtg 
cgcagtcaca 



PCI7US01/06059 



cctgcgagtc 60 
gagactggee 120 
cttggataat 180 
tacagectgg, 240 
ctaagaatct 3 00 

geggggaetc 3 60 

cccccaacag 420 
agatgtacag 480 
aggacgacag 540 
gtgggccaga 600 
aegtctgegg 660 
ctnggtgaaa 720 
749 



<210> 62 

<211> 620 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 
<223> Incyte ID No: 



LI:78 9445.1:2000FEB01 



<400> 62 

ggaacatcga 

tcagcactta 

gttggatcaa 

cattatacta 

ggaagagctg 

caaaggctcc 

ggagagaagc 

tgaacaccca 

taatcagagt 

tgctgaggga 

tcttactagg 



gaagecaagg 
gtgacctttg 
geccagagag 
gcagggtctg 
aggacaggag 
ccactgctgc 
aatactgcag 
tttcttatga 
ggataagect 
aaaatgecta 
cagaataaaa 



ctagtggctg 
aggatgtggc 
atctctacag 
aattattcaa 
tgacaggagt 
aagatatttc 
agaaactgta 
ctcacatgat 
taagaaagaa 
agtgtgttaa 



ggttactgat 
tgtggacttt 
agatgtgatg 
aegtagtetc 
tetgeaggaa 
tgcagaaaga 
tgactctaac 
aactcacatt 
ctttcctcat 
acatgaaaaa 



ggattactga 
acccaggagg 
ttggagaact 
atgtctggat 
ttggatttgc 
tcaccaaatg 
cattctggaa 
ggagagaaaa 
agtttttaca 
gccttcaacc 



tggattcttc 60 
agtggacttt 120 
acaagaatct 180 
tggaacaaat 240 
aactcaaaac 3 00 
gagtacaatt 3 60 
aagtcttcaa 420 
cttctgagga 480 
agaaaagtca 540 
agtttccaaa 600 
620 



<210> 63 

<211> 1110 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: LI : 789657 . 1 : 2000FEB01 



<220> 

<221> unsure 
<222> 449, 530 
<223> a, t, c, 



g, or other 



<400> 63 

ttagcagtgt 

atagaattca 

gcaaatcaag 

gaatgtcacc 

gagagatagc 

gcacaacatc 

tctttagtgg 

acttatgtaa 

gaatccataa 

ttcccacctt 

atgtggcaag 

agagaaacct 

tgtacatcag 

cttcatccag 

agaatggtct 



aatgaacgtg 
tactggagat 
cctcgaaaga 
aagttttcag 
ataaaaatgt 
ctagaattca 
gcagtcaaca 
tgattgtcac 
tgaataaaga 
gcagttcatc 
tgcttcagtc 
tacatatgtg 
ataattcata 
aagtcccaac 
gaacacagaa 



gcaaaggttt 
aaacgttaca 
caggagaatt 
tcacactcaa 
aagagtttgt 
cattggagag 
cttgtttacc 
aaagtcttna 
tctaacaagt 
agagaattca 
aaagttcctc 
ctgaatgtgg 
ctggagagaa 
tcactgtaca 
aagccttcag 



taaatcaaaa 
aatgtgaagc 
catactggag 
accttgaaag 
gacaaggctt 
aaagcttaca 
atcaggcaat 
gttacactac 
gtaatgaatg 
tactggagag 
tettgeaagg 
aaaggccttt 
acctgatcga 
tcagagaatt 
ggtcagttca 



agcaaagctt 
atgtgacaaa 
agaaagctta 
acacagcaga 
tcaggcataa 
agtataatga 
ccatggtgta 
aaccattgeg 
tggcaagtcn 
aaaccttaca 
catcagacag 
tctcagaagt 
tgtactgtat 
catacactaa 
agecttaata 



gcacatcagc 60 
gtttacagtc 120 
catatgtgaa 180 
attcctactg 240 
ttcgcacctg 3O0 
atgtgacagg 360 
gggaaacttt 420 
aatcattgga 480 
ttagtaaaag 540 
aatgtaatcg 600 
ttcatacagg 660 
cagaccttgt 720 
gtgggaaggc 780 
tgaaatcata 840 
gatagtgcaa 900 
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iflPcgat tttggggact acatctttgt tgaWlfei 



caaccaatgg attflPtgat tttggggact acatctttgt tgalBfeatt ttacaagtga 960 

agtcatgttc ctaatgtatt tcattcttta tcaaagataa tagagaagtc aatacgtaaa 1020 

tgatggacat tttcactatg gcatataaaa gtttttaaat tgagaaatga atgattagca 1080 

taacagaacg aattgcatgt acatctcttt 1110 

<210> 64 

<211> 899 

<212> DNA ' 

<213> Homo sapiens 

<220> 

<221> misc__feature 

<223> Incyte ID No: LI : 789808 . 1 : 2000FEB01 

<400> 64 , 
gaaatcaaga attcacacag gggagagcct ttacagatgt catgaatgga gaaatccttc 60 
agtgaaaagt catcccttac tcaaaatcag agaacacacg tggggagaaa tcatgaatgt 120 
catgaatgtg ggaaaacctc gtttaagtca gttctaactg tgcatcagaa aacacacagg 180 
ggagaagccc tatgaatgct atgcatgtgg caacaccttt ctcagaaaat ccgacctcat 240 
taaacatcag agaattcaca caggagaaaa accttatgaa tgtaatgaat gtgggaagtc 3 00 
attctccgag aagtcaaccc ttactaaaca tctaagaaca cacagatggg aaatcttatg 360 
catgtattca atgtggaaaa tttttctgct gctactacag tttcacagaa catctgagaa 420 
gacacacagg ggagaaacct tttggatgta atgaatgtgg gaaaaccttc catcagaagt 480 
tggccctaat tgttcaccag agaactcata taagacagaa accctatgga tgtaatgaat 540 
gtggaaaatc attctgtgtg aagtcaaaac tcattgcaca £catagaaca tacacagggg 600 
agaaacccta tgaatgtaat gtttgtggaa aattattatt aagtcaaaac taactgtaca 660 
tcacagaaca cacttgaggt gaaaccctat aaatgtagta agtgagggaa attactctgg 720 
gtgaagtcag aactttgtag agcagagaac ataaagggtg agagaaatct gttaatataa 780 
tgataatgag aacacctttg ccctgaagtc agttctcaca gtagagaaga gaacttaaag 840 
agggaaaaaa caatatgaag atatggaatg caggaaaaca ttattctggg atttgggcc 899 

<210> 65 

<211> 476 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID fob: LI : 792919 . 1 : 2000FEB01 
<400> 65 

catcagatga ttcatatggg acagaaccca tataattgta aagaatgtgg gaagagcttc 60 
aaatggtcct catatctttt ggtccatcaa cgagtccaca ctggagaaaa gccatacaaa 120 
tgtgaggagt gtgggaaggg ctacattagt aagtcaggtc ttgacttcca ccatagaacc 180 
cacacgggag agagatctta taactgtgat aactgcggga agagctttag acatgcttct 240 
agtattttga atcataagaa actccactgc caaagaaagc cattgaaatg tgaagactgt 300 
ggaaagaggc ttgtatgccg gtcatactgt aaagaccaac aaagagacca cagtggagaa 3 60 
aacccatcca aatgtgagga ctgtgggaag cgctacaaga ggcgcttgaa tctggatata 420 
attttatcat tatttttaaa tgacatataa gttatacata tttatggagt gtgaaa 476 

<210> 66 

<211> 597 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI:793949 . 1:2000FEB01 
<400> 66 

tgcagtcaga gctccaggtc tggttcttct cctaaaggcc caggctgtgt ggccccgtgt 60 
cctgcaggta ttgggagatc cacagctaag acaccgggac ctcctggaag ccaaaaatgg 120 
gaccattgca atttagagat gtggccatag acttctccca ggaggagtgg cattgcctgg 180 
acactgctca aagagatttg tacagatgtg taatgttgga gaactacagc aacctggtct 240 
tccttggtat tactgtttct aaaccagatg tgatttcctc attggagcaa ggaagaaaac 300 
ctttgaccat gaagagaaat gagatgatag ccaaaccctc agtgagcttc cttcaagttc 3 60 
acagtgagag ccaaagtcct cttcatgaca tataagagac tgcacagtgt ggctgctttt 420 
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ccataatttt tgggacgcac aaatgtetgc atgattttga gaaactaaaa ttactctcaa 480 
agttctcttt ttgcatcaga tctgaaatgt ctgagagtaa tagtttctgt tgaatttttt 540 
tttgttcatt tttctgca!ca gtccattctg tttttattac tatctaggct tgaaata 597 

<210> 67 
, <211> 523 ■ 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc__feature 

<223> Incyte ID No: LI : 794389 . 1 :2000FEB01 
<220> 

<221> unsure 
<222> 74 

<223> a, t, c, g, or other 

i 

<400> 67 , , 

tgtaaggact gcaaaagacc ttttgtcgtg tgatgcagtt cactctgcac aggagaattc 60 
atactggtga aaanccctat gaatgcaagg aatgtggaaa gtccttcagc gcccattctt 120 
ctcttgttac tcataagaga acacacagtg gagaaaaacc gtataaatgc aaggaatgtg 180 
gaaaagcctt cagtgcgcac tcttcccttg ttactcataa gagaacacac agtggagaga 240 
aaccctatac atgccatgcc tgtgggaagg cctttaatac ttcctccaca ctttgtcaac 3 00 
ataatagaat tcatactggt gaaaaaccct ttcagtgcag tcaatgcggg aagtccttta 360 
gctgcagctc tcaccttact cgacactgta gaatgtgtaa tggaaaattt agcaaataac 420 
caaaactcaa aatgttaatt ctgattattc tgtatcatca ccagtacttt gtactttgag 480 
tctttttaaa tatgctctgt tagtttgtat ggaatgatat ttc 523 

<210> 68 

<211> 561 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feat^ure 

<223> Incyte ID No: LI : 79 6010 . 1 : 2000FEB01 
<220> 

<221> unsure 

<222> 92, 176, 216, 233 

<223> a, t, c, g, or other 

<400> 68 

ctcattagaa atcaaagaat tcacacgggg agagacctta cagatgtcat gaatggagaa 60 
atccttcagt gaaaagtcat cccttactca anatcagaga acacacgtgg ggagaaatca 120 
tgaatgtcat gaatgtggga aaacctcgtt taagtcagtt ctaactgtgc atcagnaaac 180 
acacagggga gaagccctat gaatgctatg catgtngcaa cacctttctc agnaaatccg 240 
acctcattaa acatcagaga attcacacag gagaaaaacc ttatgaatgt aatgaatgtg 3 00 
ggaagtcatt ctccgagaag tcaaccctta ctaaacatct aagaacacac agatgggaaa 3 60 
tcttatgcat gtattcaatg tggaaaattt ttctgctgct actacagttt cacag/aacat 420 
ctgagaagac acacagggga gaaacctttt ggatgtaatg aatgtgggaa aacctftccat 480 
cagaagttgg ccctaattgt tcaccagaga actcatataa gacagaaacc ctatggatgt 540 
aatgaatgtg gaaaatcatt c 561 

<210> 69 

<211> 543 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 796324 . 1 :2000FEB01 
<220> 

<221> unsure 
<222> 51, 54, 458 
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<223> a, t, c, ?iK>r other 



<400> 69 

tctataagaa ttcagccctc attagaaatc aaagattcac acaggggaga nacnttacag 60 
atgtcatgaa tggagaaatc cttcagtgaa aagtcatccc ttactcaaaa tcagagaaca 120 
cacgtgggga gaaatcatga atgtcatgaa ,tgtgggaaaa cctcgtttaa gtcagttcta 180 
actgtgcatc agaaaacaca caggggagaa gccctatgaa tgctatgcat gtggcaacac 240 
ctttctcaga aaatccgacc tcattaaaca tcagagaatt cacacaggag aaaaacctta 300 
tgaatgtaat gaatgtggga agtcattctc cgagaagtca acccttacta aacatctaag 360 
aacacacaga tgggaaatct tatgcatgta ttcaatgtgg aaaatttttc tgctgctact 420 
acagtttcac agaacatctg agaagacaca cagggganaa accttttgga tgtaatgaat 480 
gtgggaaaac cttccatcag aagttggccc taattgttca ccagagaact catataagac 540 
aga 543 

<210> 70 

<211> 567 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 79 6373 . 1 :2000FEB01 
<400> 70 

cactggagag aaaccctatg tatgcaatga atgtgggaaa gcttcagcca gaagacatgt 60 
ttaatatccc atcagagatt tcacacagga aagacaccct ttgtatgtac tgagtgtgga 120 
aaatcctgct cacacaagtc aggtctcatt aaccaccaga gaattcacac aggagagaaa 180 
ccctatacat gcagtgactg tgggaaagct ttcagagata aatcatgtct caacagacat 240 
cggagaactc atacagggga gagaccgtat ggatgctctg attgtgggaa agctttctcc 300 
cacttgtcat gccttgttta tcataaggga atgctgcatg caagagagaa atgtgtaggt 360 
tcagtcaaat tggaaaatcc ttgctcagag agtcatagct tatcacatac acgtgatctc 420 
atacaggata aagactctgt taacatggtg actctgcaga tgccttctgt' ggcagctcag 480 
acctcattaa ctaacagtgc gttccaagca gagagcaaag tagccattgt gagccagcct 540 
gttgccagaa gttcagtctc agcagat 567 

<210> 71 

<211> 633 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 79 6415 . 1 : 2000FEB01 
<400> 71 

ctctgctctg ccaacactag tggaatatga tcacatccca gggatcagtg tcatttaggg 60 
atgtgactat gggcttcact caagaggagt ggcatcatct ggaccctgct cagaggaccc 120 
tgtagaggaa tgtgatgctg gagaactaca gccaccttgt ctcagtaggg tattgcattc 180 
ctaaaccaga agtgatcctc aaattggaga caggcaagga gccatggata ttagaggaaa 240 
aatttcgaag ccagagtcat ctgggtgagt tagtgccaga tggaatttaa agaattaatt 300 
aataccagta gaaactattc aagaatgaag ttcaatgagt ttaacaaagg tggaaaatgt 360 
ttctgtgatg aaagcatgaa ataattcatt ttgaagagga accttctgaa tataataaca 420 
atgggaacag cttctggctg aatgaagacc tcatttggca tcagaagatt aaaaattggg 480 
agcaaccttt tgaatacaat gaatgtggga aagctttccc tgagaattca ctcttccttg 540 
tacataagag agcttacaca ggacagaaaa catgcaaata tactgaacat gggaaaacct 600 
gttatatgtc attttttatt actcatcagc aaa 633 

<210> 72 

<211> 652 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 79 8636 . 1 :2000FEB01 
<220> 
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<221> unsure 
<222> 76 
<223> a, t, c, 



g, or other 



<400> 72 

taatgaatgt, 

tactggagaa 

tctcattcaa 

caaaacattc 

accatatcga 

tcagaggatt 

acaaagctca 

taatgcatgt 

cattgaagag 

tcagagaata 

gttctcatca 



ggtaaagcct 
aaaccntgta 
catcagagaa 
aggtgtaact 
tgtttagaat 
catacaggag 
tcacttattg 
gggaaactct 
gactccttaa 
catgcttgag 
gatactaagt 



taagctccca 
aatgtaaagt 
tgcatactgg 
catcgcttag 
gtgggatgtc 
aaaaaccctt 
cacatcaaag 
ttagccagag 
aagccgattt 
agtgatttat 
tttaagaata 



ctcaacactt 
atgtggaaaa 
agaaagaccc 
taatcaccag 
ttttggccaa 
taaatgtaat 
aattcatact 
gtcatccctt 
gca:tgtgtga 
taaatataat 
aactttagct 



attattcatg 
gccttcagac 
tataagtgta 
agaattcata 
agtgcagctc 
acafgtggaa 
ggagagaaac 
actaatcatt 
aagccttaaa 
gaatattgag 
atgtaataac 



agcgaattca 60 
agagttccgc 120 
acgaatgtga 180 
ctggagagaa 240 
ttatacaaca 300 
aaacttttag 360 
cctatgaatg 420 
ataaaattca 480 
ccaaaactca 540 
aaaactctta 600 
tt 652 



<210> 73 

<211> 860 

<212> DNA 

<213> Homo sapiens 

<220> ( 
<221> misc_ f eature 
<223> Incyte ID No: 



LI:800045.1:2000FEB01 



<220> 

<221> unsure 

<222> 311, 500, 509, 541-543 
<223> a, t, c, g, or other 



<400> 73 

aaa^ttatgc 

agcagctcga 

gaggaatgtg 

tctggggaga 

ctggaccagc 

aaggccttca 

ccctatgaat 

aaaaggatcc 

ataccctgga 

nnntgacaca 

cagaagtgat 

gaagctaaaa 

tcctgttaaa 

gtatatgttg 

gtcatgaaag 



acactgggga 
gccttcgggt 
ggaaagccta 
agaactgtaa 
ataaaaggat 
ngaacagctc 
gcgacatctg 
acacagtttc 
agtggtattn 
ttcagtaact 
tcagggattc 
agctaaagta, 
tgtttgattg 
tctgttgtaa 
ccagctcctt 



aaagcgctat 
ccacaaacgg 
catgtcctac 
atgtgatgaa 
ccacactggg 
tgggctcaga 
tgggaaaacc 
agatgaactc 
acacattang 
aatggaacac 
ctatgaatag 
ccagtcatct 
tttttggaat 
aatgttaacg 



gaatgtgatg 
atccacactg 
tccagcctta 
tgtggaaaat 
gagaagccct 
gtccacaaaa 
ttcagtaaca 
ccataatgaa 
ctacaataaa 
accgtcaaca 
aaatgctgag 
agagagaagg 
gtgttattgt 
aatactttgt 



actgtggagg 
gggagaagcc 
taaaccacaa 
ccttcaatta 
atgaatgtgg 
ggatccacac 
gctctggcct 
tgatgaattt 
aggttctacc 
tgaattcgca 
aaggaacgca 
aaattaatgt 
aaagatgtca 
tcagggctca 



gactttccgg 60 
gtacaagtgt 120 
aagcac ccat 180 
tagctctgtt 240 
tgagtgtggg 3 00 
gggggagaag 3 60 
cacagtgcat 420 
gtgatgaggg 480 
gtggagagga 540 
ccttacatga 600 
ttttattgca 660 
ttcttaataa 720 
tgcaggacat 780 
ctctctcttt 840 
860 



<210> 74 

<211> 501 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 
<223> Incyte ID No: 



LI: 800680. 1:2000,FEB01 



<400> 74 

gcgctctcgg 

tccacttctt 

gaaagaagag 

agaattctct 

cgtgatgttg 

agatggttct 

aaaatccgac 

tactctatcc 

tgcctctgaa 



cccacacaat 
cttaatgaat 
gagggaatgg 
caagaggagt 
gagaactaca 
caccttgcag 
ccttttattt 
gggaacttag 
agttgctttt 



atgacctcgg 
gactgactta 
ctctttctca 
gggagtgcct 
ggaacctg-ct 
cctgtggaca 
tactcccgta 
ggagacctta 

g 



ggaggatgcg 
cc tgagaaag 
gggactgttt 
ggaccctgcc 
ttctctcgat 
gagcacactg 
ccttttattt 
gtcttcctga 



aggaagatga 
aaactcagag 
acattcaagg 
cagagggcct 
gaggataaca 
cctcttcctt 
ctctcccttt 
ttccatattc 



actgtgatga 60 
gaagaggaaa 120 
atgtggccat 180 
tgtacaggga 240 
tccctocaga 300 
agaatcctac 360 
tctagctttc 420 
ttctaagatt 480 
501 
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<210> 75 

<211> 703 

<212> DNA 

<213> Homo sapiens 

<220> i 
<221> misc_feature 

<223> Incyte ID No: LI : 800894 . 1 :2000FEB01 
<400> 75 

gtctcgtcct tactgctgtg tgtcctctgc tcctagagtc ccagcctctg tggccctatg 60 
acctgcaggt attgggagat ccacaactaa gtcgccagga ccccctggaa gcctagaaat 120 
gggatcattg acatttaggg atgtggccat agaattctct ctggaggagt ggcaatgcct 180 
ggacactgca cagcagaatt tatatagaaa tgtgatgtta gagaactaca gaaacctggt 240 
cttcctgggt attgctgcct ttaagccaga cctgatcatt tttctggagg aaggaaaaga 300 
gtcctggaat atgaagagac atgagatggt ggaagaatcc ccagttatat gttctcattt 360 
tgctcaagat ctttggccag agcagggcat agaagattct ttccaaaaag tgatattgag 420 
aagatacaag attcatcatc atgcctgtga gctgggtcca ataatgaatc actatcctac 480 
ctgtggccaa atgcatatat gacagttaca actctaactg cagactgcat cggcatgtaa 540 
aatttaggtc accagtgggc tctctccatg tttgaatgtg gtgattctaa tggtcagcga 600 
ggtgtgcata ggagagtcac aatctcacct tcaggtgggc cttgtatcag cactctctct 660 
actaccttaa gactgtatga cacatgggtg agctaaaaat gtt 703 

<210> 76 
< <211> 256 
<212> DNA 
<213> Homo sapiens 

<220> 

i <221> misc_feature 

<223> Incyte ID No: LI : 801015 . 1 : 2000FEB01 

<400> 76 

ggaagccgga aaatggactc agtggccttt gaggatgtgg cagtgaactt cacccaggag 60 

gagtgggctt tgctggatcc ttggcagaaa aaactctaca gagatgtgat gctggaaacc 120 

tataggaacc tggcttcagt aggtgatgac gacaacattc cttcacttag agaacaiagtt 180 

gcccatcaac gatatttcaa gacctggcat gtggaaaggg aatacttcag taaataaacc 240 
aagcatggtg acagct 256 

<210> 77 

<211> 458 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 801236 . 1 :2000FEB01 

<220> • 
<221> unsure 

<222> 11, 25, 58, 61, 91, 428 
<223> a, t, c, g, or other 

<400> 77 

aaagtttttt ngcccacagt acatnattaa tccgacatca gagaacccac acaggagngg 60 

naaccatatg aatgtaatga atgtgggaaa ntttcagcca taccccagcc ttcattcaac 120 
atcaaagaat tcatactgga gaaaaaccct atgagtgtaa tgcatgtgga aaggccttca 180 

atcggagtgc acatcttact gaacaccaga gaactcatac tggagagaag ccctatgttt 240 

gtaaagaatg tggaaaaacc ttcagtcgaa gtacacacct tactgaacat ctaaaaattc 300 

attcttgtgt gaaaccctat caatgtaatg aatgtcagaa actgttttgc tatagaacat 360 

cactaattcg acatcagagg acgcatacag gagagaaacc ctaccagtgt aatgaatgtg 420 

ggaaatcntt cagcttaagc tcagctctaa ctaaacat 458 

<210> 78 

<211> 604 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc_feature 
<223> Incyte ID No: 



LI: 803335. 1:2000FEB01 



<400> 78 

gggccagcct 

cgtgggcccg 

gtgggcgcat 

tgtttccaag 

gctgtgcact 

agggacgtga 

cctgagatga 

aacctgagac 

ggaactggtt 

agactttttg 

aaaa 



ggggcttggt 
ggggatcccg 
cgcgggcggg 
agctacaaga 
tcacctggga 
tgctggagac 
tcttcaagct 
tttcaggcta 
tggataagct 
tcttttctga 



tctatgtccc 
gggggccgga 
cagggcctga 
aaagaataaa 
ggagtggcag 
ctacagcagc 
agagcaagga 
agcctccctc 
accttaatga 
tttaagaaaa 



tgcgggtcgg 
ccagtgttcc 
agtgcagcta 
tctctggagt 
gacctggatg 
ctggtatcat 
gcagagccat 
cttttggtag 
aggacgacac 
ctttatctcg 



tgcgagggcg 
ccagttgtgg 
tgtttccagt 
tggtgtcctt 
acgctcagag 
tggggcattg 
ggatagtaga 
aggctttttt 
atccctccag 
taagctacct 



aagaggaacc 60 
gagcagacgc 120 
gttctctggc 180 
tgaggaggta 240 
gaccctgtac 300 
cattaccaaa 3 60 
agaaacccta 420 
ataccatttg 480 
agatgataaa 540 
atagtttaca 600 
604 



<210> 79 

<211> 641 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
<223> incyte ID No: 



LI : 803 998 . 1 : 2000FEB01 



<400> 79 

aggcttcaac 

ctacaaatgt 

aagaattcat 

aattctcaac 

aagaatgtgg 

actggaggga 

cttactacat 

aaagcctcta 

ctctacaaac 

aaaagaaatc 

tggctgtcac 



cagtcctcaa 
gaagaatgtg 
actggaggaa 
gcttactgaa 
caagcctttt 
attcctacaa 
ataagacaac 
atagatcctc 
ctgaaagatg 
gtacagatga 
acttgattgt 



accttatgga 
ataaagactt 
aaaccctaca 
cataagggaa 
agctattcct 
ttgtgtggaa 
tcatactgga 
aattcttaag 
tgacaatgct 
gaaatcctag 
aggtaagata 



acataagaga 
caactggtcc 
aatatgaaaa 
tttgtacagg 
caatccttac, 
tgttgcaatg 
gagaaaccat 
agacataagc 
tttggcaaca 
aaaagtgaag 
actcatactg 



attcatactg 
tcacacctta 
atgtggcaaa 
agggaaaccc 
taaacattca 
cccttaacca 
gcatgtgtga 
taattcatac 
cctcagactt 
aatatgacaa 

g 



gtgagaagcc 60 
ctactcataa 120 
actattaatt 180 
tacaaatctg 240 
tgtaattcat 3 00 
gtccttaagg 3 60 
agaatgtggg 420 
acaagagaga 480 
ttctgaatat 540 
agcccttaag 600 
641 



<210> 80 

<211> 991 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: LI : 478757 . 1 :2000FEB01 



<400> 80 

gtgagaaggg 

tgggccgaga 

tgtgctgtcc 

ggcggcagag 

aggcatcatg 

gattcgttac 

gtacggatgc 

ccttctcggt 

gtcatttagg 

tcagaggacc 

gtattgcatc 

caggcaagga 

ttaataccag 

gtttctgtga 

caatgggaac 

ggagcaacct 

tgtacataag 



aaaggagggg 
cctcattaga 
agactcccac 
gagggcaagg 
cggagggccg 
tcgctcgttc 
ccccatcttg 
ttctctgctc 
gatgtgacta 
ctgtagagga 
attcacaggg 
gccatggata 
tagaaactat 
tgaaagcatg 
agcttctggc 
tttgaataca 
agagcttaca 



tgagggtgca 
gatctagagc 
aggctgggtg 
cccgaaggag 
agcttagcag 
tcgatgttga 
atcttacaga 
tgccaacact 
tgggcttcac 
atgtgatgct 
tattgcattc 
ttagaggaaa 
tcaagaatga 
aaataattca 
tgaatgaaga 
gtgaatgtgg 
caggacaaaa 



ggggccctcc 
accggatggg 
atggtgggag 
cagacagcac 
ccaagtggag 
ctgctggcat 
atccagaggt 
agtggatatg 
tcaagaggag 
ggagaactac 
ctaaaccaga 
aatttcgaag 
agttcaatga 
ttttgaagag 
cctcatttgg 
gaaagctttc 



taggaggaca 
aggagcagag 
atttgtgccc 
cgcttcttgg 
gacagcaccc 
attgcagcac 
gcagccgcaa 
atcacatccc 
tggcatcatc 
agccaccttg 
agtgatcctc 
ccagagtcat 
gtttaacaaa 
gaaccttctg 
catcagaaga 
cctgagaatt 



tcggacctgt 60 
gctcgatttg 120 
attcagatgt 180 
ggacttgtga 240 
tccatgcctg 300 
aactagagat 360 
gaaagctaca 420 
agggatcagt 480 
tggaccctgc 540 
tctcagtagg 600 
aaattggaga ttQ 
ctggaattaa 720 
ggtggaaaat 780 
aatataataa 840 
ttaaaaattg 900 
cactcttcct 930 
991 



37/228 



BNSDOCID: <WO 0162927A2 I > 



WO 01/62927 



PCTYUS01/06059 



<210> 81 

<211> 680 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc__f eature . 

<223> Incyte ID No: LI : 808532 . 1 : 2000FEB01 



<400> 81 

gcacaatttc 

tgaaatatgt 

cacagcagag 

tttgcataag 

gaagagcttc 

gccatacaaa 

ccatagaacc 

acatgcttct 

tgaagactgt 

cagtggagaa 

tctggatata 

gtgaaatttg 



cagcttcaga 
ggtaagagct 
aaactgtaca 
catcagatga 
aaatggtcct 
tgtgaggagt 
cacacgggag 
agtattttga 
ggaaagaggc 
aacccatcca 
attttatcat 
atacatgtat 



aacatcacag 
tctgtcttag 
aatctgaaaa 
ttcatatggg 
catatctttt 
gtgggaaggg 
agagatctta 
atcataagaa 
ttgtatgccg 
aatgtgagga 
tatttttaaa 



aattcatact 
gtcaagtctt 
gtatggaaga 
acagaaacca 
ggtccatcaa 
ctacattagt 
taactgtgat 
actccactgc 
gtcatactgt 
ctgtgggaag 
tgacatataa 



ggggagaagc 
aataggcatt 
ggtttcattg 
tataattgta 
cgagtccaca 
aagtcaggtc 
aactgcggga 
caaagaaagc 
aaagaccaac 
cgctacaaga 
gttatacata 



cattcaaatg 60 
gcatggtcca 120 
ataggctaga 180 
aagaatgtgg 240 
ctggagaaaa 300 
ttgacttcca 360 
agagctttag 420 
cattgaaatg 480 
aaagagacca 540 
ggcgcttgaa 600 
tttatggagt 660 
680 



<210> 82 

<211> 420 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mi sc_f eature 
<223> Incyte ID No: 



LI: 443073. 1:2000FEB01 



<400> 82 

aaaaccctat 

acatcagaga 

tagctatagt 

atgtaaagat 

tcatactatg 

gccaagacct 

ccatcccatc 



atgtgtaagg 
attcatactg 
ggatctctta 
tgcaggaaat 
gagtcattct 
gtaggtttca 
atccttgtcc 



aatgtaggaa 
gagagaaacc 
ctctacatca 
ctttcaggca 
tgactctttc 
tctcctgaat 
aatgcacatt 



aacatttagc 
ttatgaatgt 
gagaattcat 
gcgtgcacat 
ctctccctca 
atttctggaa 
aatatatttg 



cagaatgcag 
aatgtttgtg 
actggagaaa 
cttgctcatc 
ccctccacat 
tccacctctt 
acatgggata 



gccttgctca 60 
ggaaagcatt 120 
gaccctatga 180 
atgagagaat 240 
caaatcagtt 300 
gaatccattt 360 
ctcgagtagc 420 



<210> 83 

<211> 659 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 



LI : 47 9 67 1 . 1 : 2 0 0 0FEB0 1 



<400> 83 

cgtgcccgca 

tcaggaggct 

gggaggatgt 

gctgtgaact 

agagacgtga 

cagagcattg 

tttccaagac 

aggattaact 

ttctctaaac 

gagattggga 

ctgacgcatc 



cgtgcgtgtc 
ctgccgcagc 
cgggacaccc 
tcacccagga 
tgcaggaaac 
aagatcagta 
aaggaagttg 
tcgtgactcc 
ctttgaagtc 
tccatctagt 
ttggaagaac 



tcggtcagta 
cggggccctc 
cggaagctgg 
ggagtgggct 
cttcaggaat 
caaaaattct 
tgcaaagcct 
aaagcccatg 
ctggaaagta 
actcttcacc 
ttggaataac 



gccctgcgct 
ctgtgacctg 
gaaatgaact 
ttgttggatc 
ctggcttcca 
tcaagaaatc 
cacagctact 
cacctctgca 
actcttctgc 
atttggaaca 
ctgaggaaat 



tctccttcac 
catgtactgg 
cggttgcctt 
cttcccagaa 
taggaaacaa 
taagatgagg 
aagaggctgg 
agccacacga 
aatagaagca 
actgcctcca 
ggtcgatttg 



tctcggcggt 60 
gggattcgca 120 
tgaggatgtg 180 
gaatctctat 240 
aggggaagac 300 
tttatataac 360 
gctgggactc 420 
atcaagcagc 480 
agaatcctgt 540 
gagaatgctt 600 
ctaatttag 659 



<210> 84 

<211> 772 

<212> DNA 

<213> Homo sapiens 



38/228 



BNSDOCID: <WO 0162927A2_!_> 



WO 01/62927 



PCT/US01/06059 



<220> 

<221> misc_f eature 
<223> Incyte ID No: 



LI : 810078 . 1 : 2000FEB01 



<400> 84 

ccctatgtat 

gtacgatctc 

acatcctcac 

gttgaatgtg 

actgaagaga 

cttaataatc 

aaggctttta 

ccctatgagt 

gaaaggactc 

tgaagagagt 

tcagcaatgc 

ctagtgagaa 

taaagacatg 



gtaaggaatg 
acagtggaga 
gccttattca 
ggaaagcctt 
aggcctgtga 
acatgcgaac 
actattccac 
gtaaacaatg 
acactggaga 
tcctttagaa 
gggaaagctt 
actgtccatg 
aatgaactca 



tggigaaagcc 
caagccctat 
acatataaga 
tgcagtttcc 
gtgtaagata 
gcacagtgcc 
ccaccttaaa 
tggaaaggcc 
gaaaccctat 
ttcatgaaaa 
acagtcatcc 
taataaatgt 
ctctggagag 



ttcactcagt 
gaatgtaagg 
actcacactg 
tcaaatctta 
tgtgggaaag 
cagaaaccat 
attcacatgc 
ttcagtcatt 
gaatgcaagg 
aactcacaca 
ccgttcactt 
gggaaagctc 
aagaagc tgc 



actcgggcct 
aatgtgggaa 
gagagaagcc 
gtggacattt 
tatttgggta 
acacctgtaa 
gaatccacac 
ccagttcatt 
agtgtgggaa 
gaagagaaac 
cgaagacatg 
tcatttgttc 
atgaaaatta 



tagtatgcat 60 
atccttcctt 120 
ttttgtatgt 180 
gagaactcac 240 
tccctcatgt 300 
ggaatgtggg 360 
tggagaaaaa 420 
tcaaatacat 480 
agccttcacg 540 
cctataaatg 600 
aacaaattca 660 
cagttcactt 720 
ct 772 



<210> 85 

<211> 441 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: LI : 810224 . 1 : 2000FEB01 



<400> 85 

ggggagaaat 

gcatcagcaa 

tcagaaaatc 

gtaatgaatg 

acagatggga 

ttcacagaac 

aaaaccttcc 

cacgcactat 



catgaatgtc 
acacccaggg 
cgacctcatt 
tgggaagtca 
aatcttatgc 
atctgagaag 
atcagaagtt 
agatgtactg 



atgaatgtgg 
gagaagccct 
aaacatcaga 
ttctccgaga 
atgtattcaa 
ccacacaggg 
gccctaattg 
a 



gaaaccctcg 
atgaatgcta 
gaattcacac 
agtcaaccct 
tgtggaaaat 
gagaaacctt 
ttcaccaaga 



tttaagtcag 
tgcatgtggc 
aggagaaaaa 
tactaaacat 
ttttctgctg 
ttggatgtaa 
gaactcaata 



ttctaactgt 60 
aacacctttc 120 
ccttatgaat 180 
ctaagaacac 240 
ctactacagt 300 
tgaatgtggg 3 60 
taatgacaga 420 
441 



<210> 86 

<211> 1072 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: LI : 817052 . 2 : 2000FEB01 



<400> 86 
aattccccgg 

gggtgacccc 

ggagtccacg 
gaactgcacg 
ggaggaggat 
ggccagtatg 
cccgcccctg 
gcggcacctg 
gagtgcggca 
aaggaggggc 
acgctgcacc 
acgttcaacc 
ggcggtggcc 
agcctgacca 
aagaagagct 
catgaggtct 
ctgcgccacc 
agcttcagcg 



gtcgacccac 
tgtggggaac 
gaaggctcca 
gaggggatcg 
gagctgccgc 
taccagaccc 
cagctaggaa 
agagccagca 
agagcttccg 
cctacgagtg 
agcgcatcca 
ccaagcacgc 
ctaagcccta 
aacaccagat 
tccgcctgca 
ccttcatctg 
agcggactca 
agaagtccaa 



gcgtccgggt 
agccagacct 
gcgagttcag 
tgatcaagac 
agcacttgca 
cgctgcccgg 
accccgcagt 
gcagcggaac 
cctgaagatc 
cgccgaatgc 
ccgcgtgcgc 
gctcaagccg 
caagtgcccc 
cacgcacacg 
catcagcttg 
cagcctgtgc 
cacaggcgag 
gctcaccaac 



ggaagagcac 
ggacatgcag 
cgaactgaag 
agaggaacaa 
atcccttggg 
ggagatgtcc 
gaaaaggctg 
cggcgcggcg 
aacctcatca 
gagatcagct 
ggaggctgcg 
cgtcccaagt 
gagtgcgaca 
ggtgagcggc 
gtgatccatc 
ggcaagagct 
cggcccttca 
cactgccgcg 



ggcttccagg 
gagccagaga 
cagatgctgg 
gacgaccact 
cagctgtccg 
cccgagggcg 
gcgccctccg 
agcggccctt 
• tccaccagcg 
tccggcacaa 
tctcacccga 
cacccagctc 
gcagcttcag 
cctacacgtg 
agcgcgtgca 
tcagccgccc 
agtgccccga 
tgcactcgcg 



actcagagct 
acacgctgga 
tgcagcagag 
tacaccacga 
ggagatatga 
aggagagccc 
tgcacggtga 
cacatgcatg 
caaccacatc 
gcaacagctc 
acgcgggccc 
tggtagcggc 
ccacaagtcc 
occcgagtgc 
cgcgggcaag 
ctcgcacctg 
gtgcgagaag 
eg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1072 



39/228 



BNSDOCIO: <WO 0162927A2_1_> 



WO 01/62927 



PCT/US01/06059 



<210> 87 

<211> 759 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LG : 892274 . 1 : 2000MAY19 

i 

<400> 87 

ctgtcgggaa atgagatggg ggcctggact tgggtcccac aaacagtgag ggcattccca 60 
gtccagacct taaccctgtt ttaggcatgg gttcctggag gcacattgat tccatcaccc 120 
caggaacacc aggaagtgca ggattagatc tacctgccag agaaagaatc acattagttg 180 
ggggagacaa acccatcaaa gttcccactg gtatttgggg aacttcacca gcaggatiaca 240 
tgggactaat tttaggcaaa agccgcctta acttgcaagg catgactgta gtcccaggag 300 
ctgttgactc tgattatgaa ggagaaactc aagtagtttt aatgtcacaa gatcttitggg 360 
tttttgaact gggagaatat attgctcaat tattgcttat tccctgcaaa ttacaccctt 420 
ctccatgaaa ggagaaacga ggaaataaag ggtttgggag cacaactaca tgggaaatct '480 
atctatcccg acccatagcc tctaatagac ccacctgtgt agtacaaatt aaaggaaaga 540 
aattttatgg gcttatggat acgggaactg atgtatcagt aatatctaaa gacaattggc 600 
ccacatcctg gctcttgcaa ttaacttcta catccctagt ggaagtagga acagctcaaa 660 
gtgttcaaca gagtgctgag attttacctt gtcttggtcc ggatgggcag tcatgtactt 720 
ttcagcctta tgttgcaaat atagctatca gtttatggg 759 

<210> 88 , 
<211> 570 

<212> DNA ' , 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LG: 1080959 . 1 :2000MAY19 
<400> 88 

aaaaaaacaa gttgctttat ggaagaaagt aagtatagac agagagaaag ggatctgatg 60 
accaaagcag ggaataaatg tttggagtcc acggcatcct gagaacttct tgggaataga 120 
gtctaggccc ccaatgctgt cactctcacc catcctcctc tacacatgtg agatgtttca 180 
ggacccagtg gctt,ttaagg atgtggctgt gaacttcacc caggaggagt gggctttgct 240 
ggatatttcc cagaagaatc tctacaggga agtgatgctg gaaactttct ggaacctgac 3 00 
ctctatagga aaaaagtgga aagaccagaa cattgaatat gagtaccaaa accccaggag 360 
aaacttcagg agtgtcacag aagagaaagt caatgaaatt aaagaagaca gtcattgtgg 420 
agaaactttt accccagttc cagatgacag gctgaacttc cagaagaaga aagcttctcc 480 
tgaagtaaaa tcatgtgaca gctttgtgtg tgaagttggc ctaggtaact catcttctaa 540 
tatgaacatc agaggtgaca ctggacacaa 570 

<210> 89 

<211> 408 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LG: 1054900 . 1 :2000MAY19 
<400> 89 

cggacgcgtg ggctaggccc ccagtgctgt cactctcacc catcctcctc tacacatgtg 60 
agatgtttca ggacccagtg gcctttgatg atgttgctgt gaacttcacc caggaggagt 120 
gggctttgct ggatatttcc cagaggaaac tctacaagga agtgatgctg gaaactttca 180 
ggaacctgac ctctgtagga aaaagttgga aagaccagaa cattgaatat gagtaccaaa 240 
accccaggag aaacttcagg agtctcatag aaaagaaagt caatgaaatt aaagatgaca 300 
gtcattgtgg agaaactttt acccaggttc cagatgacag gctgaacttc caggagaaga 360 
aagcttctcc tgaaataaaa tcatgtgaca gctttgtgtg tggaaaac 408 

<210> 90 

<211> 532 

<212> DNA 

<213> Homo sapiens 



40/228 
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<220> 

<221> misc_feature 

<223> Incyte ID No:' LG : 1077357 . 1 : 2000MAY19 



<400> 90 

cattcatgga. 

gcaaaggagt 

gaattctctc 

gtgatgttgg 

attatttcca 

gtaaaaaata 

ttgggaagca 

catctgtctg 

gaaaagttta 



ttgacttata 
cagggatggc 
aggaggagtg 
agaactacag 
tgttgaagca 
cagatggaag 
atgcagaaaa 
aattgcagct 
caaaccatcg 



aacagtcatg 
tcttcctcag 
gaaatccctg 
gaacctggtc 
aaggagagag 
ggaatgtgtc 
caagcctatt 
gtttcaagcc 
ttcctcagtt 



ctatgtgatg 
ggacgcttga 
gaccctggac 
tttctgggaa 
cccttgattc 
agaagcgtga 
aaaaatcaac 
ggaaggaaaa 
tcaccacttc 



aagaagccca 
cattcatgga 
agagggcttt 
tctgtcttcc 
tgcaaagtca 
acapagggag 
ttggattaac 
tttacagaag 
aaaaaatttc 



gaagaggaaa 60 
cgtggccatc 120 
atacagggac 180 
tgacctaagt 240 
agttaaaata 300 
gagctgtgta 360 
ccttgagtca 420 
taatccagtt 480 
tt 1 532 



<210> 91 

<211> 546 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 



LG: 1084051. 1:2000MAY19 



<400> 91 

caacaagtta 

aacctattca 

aggaatgtgg 

ctggtgagaa 

ttaatcagca 

aagctttcag 

cttttacttg 

taagaattca 

atagagcgta 

aatgtg 



cataagaaca 
ccttactgaa 
aagggccttt 
accctttgaa 
tcatagaact 
ccagaacatt 
caatgaatgt 
taccggggag 
tctaacacat 



aaaacctatg 
catatgagaa 
agtcaaagtg 
tgtgaggaat 
cacactgggg 
agcttggttc 
gggaaaacct 
aagccctatg 
caccagagat 



aatgtaatat 
ttcatactgg 
catccctcag 
gtgggaaagc 
agaaacccta 
aacatttgag 
ttagacagat 
catgcactgc 
ccatactggg 



atgtgaaaaa 
tgagaaacct 
tacacaccag 
cttcagacat 
tgtatgtgat 
gactcattct 
tagacacctt 
atgttgtaaa 
gagagaccta 



atcttcaaac 60 
ttcagatgta 120 
agaatccata 180 
cgctcabcac 240 
aaatgtcaga 300 
ggagagaaac 3 60 
agtgaacata 420 
acctttagtc 480 
caatgtaaag 540 
546 



<210> 92 < 
<211> 1386 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
<223> Incyte ID No: 



LG:1076853 .1:2000MAY19 



<400> 92 

gcctttagtc 

tttgaatgta 

aggattcata 

agcacatctc 

gactgtgggg 

gcagagaacc 

gagaaaccct 

gagcatcaca 

tttagtcgac 

gaatgtattg 

actcatactg 

accaacctgc 

tgtgggaaaa 

aataaactct 
gattcaggat 
tacgttgtgt 
cacattctca 
gggatgatat 
agtttcaaca 
tgcagtaatg 
cttacattaa 



gatgttcttc 
gcatatgtgg 
agagaggcaa 
tcactcagga 
aagactttag 
cctatgattg 
atcaatgtaa 
gaattcatac 
gctcatcgct 
actgtgggaa 
gagagaagcc 
atgatcatca 
acttcagccg 
aggaaccgtg 
cagaggattc 
gtggagaaaa 
gatctgatta 
gcctgtatgt 
tcttgacttg 
accaaaaccc 
cttcaccatg 



ccttgtccaa 
gagggctttt 
accttaccaa 
tgaaagcact 
tcacattaca 
tgagcaggct 
tgtatgtggg 
tggagagaaa 
tactcaacat 
agcctttagt 
ctatgaatgt 
gagaattcat 
aagttcagct 
aaattaagga 
ttagagagct 
ctgccagtag 
cagactagtg 
tggacttggc 
tgacccccaa 
a-ttttaaaaa 
gaaaccagtt 



catgagagga 
ggtcagagcc 
agcagtaact 
cttaccgaag 
gactttactg 
tttagtcagc 
aaagctttca 
ccctatgaat 
gagagaaccc 
caaagttcat 
aatgaatgtg 
actggagaaa 
cttactaaac 
atttgcagaa 
tgggaatgta 
acagattttt 
taaaaacagc 
tttggaatat 
tgtcaacagc 
ttgcttgaca 
ccaactccag 



ctcatactgg 
catcccttta 
acagcataga 
tgaaatccta 
accatcagag 
aagctatttc 
aaaggagtac 
gtaatgagtg 
acactggaga 
ctctcattca 
ggagagcctt 
aaccctattc 
accagagaat 
tgctttagct 
atgaattacg 
ttttttttaa 
tacatgtatg 
atgtatgcag 
ttttttaaaa 
actgcactca 
gaagfccacca 



agagaaacct 
taaacatatg 
tttcaagcac 
ccattgtaat 
gatccatact 
tca1:cctgga 
aagttt<:ata 
tggagaagcc 
gaaaccctat 
gcatgagaga 
ccgaaaaaaa 
ttgtaaggaa 
toatactcga 
aaaatgttct 
tgtgtgttta 
cataaagaca 
tagctggttg 
gatatcatca 
agcaaattcc 
actggcagct 
ttcaaagaat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

730 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 
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tagatcaatt agcc^BKcca cttcattggt ccagtatgaa gac^Pfaaag ccaaagatgt 1320 
gacgtggttc ccacagtatg atacagccta taagggtaaa gctgggttaa aaatgcggtt 1380 
cctgga 1386 

<210> 93 

<211> 1039 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LG: 481631 . 10 :2000MAY19 



<400> 93 

acaggattga cttctaaaga ctcttggtac ctgaggaaga aacccagaag aggaagagga 60 

aagcaaagga gtcggggatg gctctttctc agggtctatt gacattcagg gacgtggcca 120 

tagaattctc tcaggaggag tggaaatgcc tggaccctgc tcagaggact ctatacagag 180 

acgtgatgct ggagaattat aggaacctgg tctccctgga tatctcttcc aaatgcacga 240 

tgaaggagtt cttgtcaaca gcgcaaggca acagagaagt gttccatgca gggacattgc 300 

aaatacatga aagtcatcac aatggagatt tttgctacca ggatgttgat aaagatattc 360 

atgactatga atttcaatgg caagaagatg aaagaaatgg ccatgaagca cccatgacaa 420 

aaatcaaaaa gttgacaggt attacagaac gatatgatca aagtcatgct agaaacaagc 480 

ctattaaaga tcagcttgga tcaagctttc attcgcatct gcctgaaatg cacatatttc 540 

agaccgaaga gaaaattgat aatcaagttg tgaagtctat ccacgatgct tccttggttt 600 

i caacagccca aagaatttct tgtaggccca aaacccatat atctaataac catgggaata 660 

atttctggaa ttcttcatta ctcacacaaa aacaggaagt acacatgaga gaaaaatctt 720 

tccaatgtaa tgagagtggc aaagccttta attacagctc actcttaagg aaacatcaga 780 

taatccattt agcagacaaa tataaatgtg atgtatgtgg caagctcttt aatcagaagc 840 

gaaacctagc atgccatcgt agatgtcaca ctggtgagaa tccttacaag tgtaatgagt 900 

gtggcaagac cttcagtcag acgtcatccc ttacatgcca tcgtagactt catactggag 960 

agaaacctta caaatgtgaa gaatgtgaca aagctttcca tttcaaatca atacttgaaa 1020 

gacataggat aattcatac 1039 

<210> 94 

<211> 521 

<212> DNA. 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 1088431 .2 :2000MAY19 
<400> 94 

cggatctcgc ggggctactg gcgctctcgg cccacacaat atgacctcgg ggaggatgcg 60 
aggaagatga actgtgatga tccacttctt cttaatgaat gactgactta cctgagaaag 120 
aaactcagag gaagaggaaa gaaagaagag gagggaatgg ctctttctca gggactgttt 180 
acattcaagg atgtggccat agaattctct caagaggagt gggagtgcct ggaccctgcc 240 
cagagggcct tgtacaggga cgtgatgttg gagaactaca ggaacctgct ttctctcgat 300 
gaggataaca tccctccaga agatggttct caccttgcag cctgtggaca gagcacactg 360 
cctcttcctt agaatcctac aaaatccgac ccttttattt tactcccgta ccttttattt 420 
ctctcccttt tctagctttc tactctatcc gggaacttag ggagacctta gtcttcctga 480 
ttccatattc ttctaagatt tgcctctgaa agttgctttt g 521 

<210> 95 

<211> 1243 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 401619 . 10 :2000MAY01 
<400> 95 

gctcttcatc ctttaccaaa gagacaagca cttgaaaaaa gcaatggtac cagcgcggtc 60 
tttaacccca gcgtcttgca ctaccagcag gctctcacca gcgcacagtt gcagcaacac 120 
gccgcgttca ttccaacagg tatgtgccct tactgcccta cgtcctgtgc ccttctggtc 180 
atgtgctttc ttctcatttc tctaagctgt ttggtggcat ctagtttgct tttgaaggta 240 



42/228 
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taatacagtt tgaaattcat ^^ttgtccta 
tgatagctaa agactgttag gattctaaag 
ttaacccatg agaaacagca gttagggctt 
aaattgttta tgtaaataag 
aatgacctaa aaatagcaat 
gcactaaaac aatgaggcta 
ttctatctct ttctccatcc 
atcctcagtg tctaagccaa 
ccactaaaag atccattgta 
ctcctcatcc agctgattgt 
actttccaag gtgccacaac 
tttacatcac attctaccac 
gaaaaaagaa aaaagcagat 



accttttatg 
tccagtaaat 
ttcaaattaa 
ttctcaactt 
tgcctgatac 
atagtcatag 
tttacagatg 
taatgagaaa 
ctgggcccgc 
aaaagtgaat 
ataaatcatg 
cacaccatac 
tagaagcaag 
tcaacatttt 



gctatctaaa 
ccaaatattt 
ttggtttctt 
tgtgacaatt 
actaaccatt 
agcaattctc 
tcaccaagtt 
tattacgtac 
aatcttagag 
agaaaac tga 
aagaactgag 
ctatatatac 
ttttaaataa 
tcatccctcc 
gcaaagatag 
gggtaactta 
cagcgccggt 



PCT/US01/06059 



aaaagtaaaa aaacaaaagt 
taaccacaga cacacacacg 
tttcaccttt tagtcaatgt 
tccattgagt tctttgagag 

<210> 96 

<211> 4338 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc__feature 

<223> Incyte ID No: LI : 1144007 . 1 : 2000MAY01 
<220> 

<221> unsure 

<222> 2673, 3345, 4031 

<223> a, t, c, g, or other 



tgtatttacc^Kactttgaa 

gatagattga agagacagat 
gtatttgcac aagccctgta 
gaaatttgtc cttaactctg 
tttttctatt tctattcaga, 
tactcatatt tttatattca 
cacaagtata tagagctctt 
gatgtgcatt aactatgatt 
tttaaaggac tcttagtgat 
ggccccctaa atgagaagtg 
tttccctgtg accaaaccca 
acattccaca gagttctcct 
ctgacc'ccaa aaagtcagat 
cccatttgca ccgacatctc 
tcaccataat tgaccatgtt 
agtgctgagt tgggagagcc 
agt 



300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1243 



<400> 96 

cgtgttccga 

agtgggtggg 

tggaagaaaa 

agatggtgtg 

ggatctttgg 

gccagagtct 

acgcttccgc 

ttgtcaccag 

gctggaacaa 

gccagaaagt 

accaaggcgg 

gcatttagga 

ccccgagcag 

gcgtccacac 

ggacccagac 

tgctctgtca 

gtggagtgat 

actgtggaat 

agggttagac 

gccagaaatc 

ggagcaggaa 

tcaccagact 

aagatttggt 

acacccagtc 

cacttgtaac 

aaatgtatgg 

aaaggttcac 

gagacctccc 

gatgattgcg 

tactggagaa 

gttaacagac 

ggtgaggact 

ggaactctac 

gcacttgaga 



cccgctaggc 
agagaagccg 
tcctcgctgt 
agaccacccc 
gaagaagagg 
gtcttacaga 
taccaggagg 
tggctgagac 
tttcttaccg 
ggcgaggagg 
tgggtgactg 
gcggagcctg 
tctcctgagg 
caggaagagg 
cttcctgcag 
cagggactgg 
ctggacccaa 
tgttgtctct 
gaggaagagc 
ctgagtttta 
gatctgagtt 
ccagattggg 
tactgaatat 
caccccctgt 
tcccaccttg 
gaatgtggga 
aaggatgaac 
ctgtgacaca 
ggaaagcact 
aaacccttct 
acaccacaag 
tcagtgaaca 
ctctgcagcg 
cgcaacgcct 



cccgcgcggc 
gggcagggga 
gtccaggctg 
aaagccaaga 
gaattctgat 
gggatgaccc 
cagcaagccc 
cagagaggcg 
tcctacctgg 
cagtgacgct 
tccatgttca 
agtcacctaa 
aaaccacaca 
agctccagac 
agaggagctc 
taacgttcaa 
cacagaaaga 
ctgtccattc 
ctggggtccc 
cctacacagg 
tggaggatat 
aaatagtctt 
ttcttcaagt 
tagggaggca 
ttagacacct 
aaaagttaca 
gcgccttaca 
ggctgagaga 
tccgctggac 
tttgtactat 
aatccacc-tg 
caggcggtac 
agtgcgggcg 
cagtgaggcc 



tcggaatccg 
ggagccgccg 
aggcgggggg 
aatggctaca 
ggtgaaactg 
ggtgctggaa 
tagagaagct 
gacaaaggag 
agaactacag 
ggtggagggt 
cggccaggaa 
tgagctgcag 
gagcccagat 
cctgcaggag 
tggagactca 
ggatgtggcc 
gttctatgga 
ccaatccocc 
agatatccaa 
agataggagt 
acacaggcct 
tgaggacaat 
gaatagtttt 
tcatgactgt 
gaggactcac 
cacggaggct 
aatatcccct 
actccatcag 
ttcagacctt 
ttgtggcaaa 
ggaggcaaac 
ctggcgcacc 
ctgcttcacc 
ctgccgatgc 



gcggcgctgt 
gagctgtcgg 
ctaatgacag 
gccgtggaac 
gaagatgatt 
acctcccacc 
ctcatcagac 
cagatcctag 
agctgggtgc 
ttgcagaaac 
gtcctgtcag 
gatcctgtgc 
ctgggggcac 
agcgaggtcc 
gagatggttg 
gtatgctttt 
gaatatgtct 
agacctgatg 
gagcctcagg 
aaagatgagg 
gttttgggag 
ccaggtagac 
gtggaacctt 
tctgtgtgtg 
acaggaggag 
cacatcttgc 
aaaccggaag 
tggagaaacc 
gtcagacatc 
agcttcagcc 
cctacttgtg 
ggaagacgca 
cacagcgcac 
aacgaatgtg 



ttcggtcggg 
agccggccct 
tgtgagctct 
cagaggacca 
tcacctgtcg 
agaacttccg 
tccgagaact 
agctgcttgt 
ggggccaacg 
aacccaggag 
aggagacggt 
aaagctcgac 
cggcagagca 
cagtgcccga 
ctcttcttac 
cccaggacca 
tggaagaaag 
agattctccc 
agactcaaga 
aagagtgtct 
aaccagaaat 
ttaatgaaag 
cggggaaact 
gaaagagctt 
gaaaccctat 
cagggcacca 
aattttggaa 
ctatagatgt 
agaggacaca 
agaaatctgt 
tggagagfcgt 
cgctgctgta 
ggat-cgccaa 
gggaagagct 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 
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tcagtcgcag 
ttttcccaac 
atcagaggac 
ttcagccctc 
ccttttcgag 
acacgagcct 
aaagtacaac 
gggtataaac ■ 
aaataggctg 
aggacagccc 

ggggtcaaat 

tgggctttgc 
cttcttacca 
aggctgcttc 
aggtgccaaa 
ttggttttgt 
tattcaacca 
gtttgagtgc 
atatgcacaa 
gatttggaaa 
gggccacttt 
aagaaaaagc 
atcccgtgac 
ttctcccaca 
ttttatgttc 
gttaaaggtt 
ggagtggcaa 
agcatggaac 
ttatgggccc 
aaattctcat 
cgttgtcctt 
gttccctggt 
tgatacttct 
ttttccttgg 
acattatttg 
taagcaggca 
tctgccacaa 
tactggcagt 
aaaatctgtc 



gga etc 
gtgccctacc 
cccactcaga 
acctaaggga 
ggagtctccc 
gggcagaacc 
ctgaattgag 
gtaggagaac 
ttacatatcc 
ctttaaggtt 
cttaaagcac 
cacacattca 
tgcactttcc 
atgtggacct 
cactgetaga 
ttttaatatt 
gtctcttggg 
ttattgagta 
gcccaccaac 
ctgtcaacct 
atttacatga 
tgttgctggg 
tgagtactgt 
ggattaattg 
atagagptag 
ccttggagat 
gaactgggga 
aggggagagt 
gaggaatagg 
tttggtcgat 
ctggaaatat 
ttctgggact 
ggagggtggt 
nagggaaagg 
cactctgaga 
tttctaaacc 
aagatagttt 
gacaacaaat 
agagttcg 



gtcaggcatc 
tgtggaaaaa 
aaagacctcc 
ggtgaggaag 
tgacctgctc 
tctcagcctt 
gcttctcctt 
taagagactt 
taaagactgc 
tgggaagggg 
acagttctgg 
cagggataac 
tttgatgctg 
tggtcaagct 
aagggggggc 
tttttcctat 
ttgcatccct 
aagcataatt 
cccttcacac 
tacaaaggca 
agacaactgg 
caagaccatc 
gaeatcaegg 
cgcttcaatg 
aaagtagcat 
gaggctgaat 
gaccgtcctc 
cttctgtgcc 
ggagggttaa 
gatattctga 
acatatatga 
tcccatctta 
tcatactcca 
gaagggttgg 
taagatccaa 
aagtatctaa 
ctcctagctc 
tgacttttta 



agagaacaca 
gcttcagcag 
tagctaggtc 
aggaaaagee 
agatctgaca 
cctctacgcc 
cacteggagt 
aagagegteg 
tcaacaagct 
aegaagectg 
ganttcaaga 
ttgeatagat 
aggagaaatg 
ggctccctcc 
ctaggtcaga 
taaaatactt 
agcacttctc 
eggtcataait 
cccccttcac 
ctataacaat 
gagaacgact 
aagactgttc 
aaagtgttga 
tatttgtttt 
ctcatgatgg 
aattatgaac 
tccatagtgg 
tggtttcttc 
taagactcca 
tggactggtt 
aataacataa 
cccattcctt 
aattagattc 
gggatggatt 
gcctggagtt 
gcctaagcac 
agacttaacc 
attttcttat 



PCT7US01/06059 



cad^^ggag 

aggatatcac 
cccatgtgag 
ctcttgtcag 
ttacctcttc 
ttgaggggca 
gcacctgcct 
tggttcctat 
tcaagttgaa 
aagggattct 
caggaeggtt 
tccctcgctg 
gaagtgggcg 
cccagcgtca 
agcctctttc 
catgeattta 
actacaagtg 
gaaatcgttc 
aggggtegta 
t^cagaatca 
aagaccaaat 
tgacncaccc 
acctgggaac 
gtttgctttt 
cccaacaatc 
ctcacccttc 
agcacagggt 
tcctctatct 
accctaatgg 
tggtcttaat 
eggtaacact 
ttccagggct 
tggggagtta 
taggcagagt 
tgcagaagat 
agcttgtcct 
atttataaag 
ttgeattatt 



aaaaccaatt 
ttaattaggc 
gagatctget 
cctgggaaga 
ctgcaactaa 
tgtttcatcc 
ctaccatcat 
ategtccaaa 
aagtggccaa 
gtctttactg 
tgcgtcctga 
tcttgattca 
aaaaatctca 
aattgttatc 
catacgagtt 
accttcccgt 
agatggtagt 
acattccctc 
tgagtaaggg 
tgattgecat 
tatggaaaat 
tgtccccatc 
ctg;tagagga 
tcttacgtga 
tctgttgcca 
tctgattgtg 
atggggttaa 
ecataatgea 
cccaacaggg 
accagtcaac 
tggcagccaa 
tccagtgtcc 
cagagtaatt 
tccggtggaa 
actgtcctaa 
gggtgaaatg 
gttggtaaaa 
ccaataaatg 



2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4338 



<210> 97 

<211> 1827 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 331074 . 1 :2000MAY01 



<220> 

<221> unsure 
<222> 32, 57 
<223> a, t, c, 



g, or other 



<400> 97 

cctacaccat 

gcgaggatag 

cgtttgttca 

gggccctgcc 

gctcaccgaa 

caagatgccc 

gcgcaactca 

tgagctcaag 

ctcccccaat 

ccgcagcaag 

eggctgeatg 

cgaccggccc 



ggaggtgctg 
atgtgctcac 
eggactttet 
aagcactaca 
cacaaggtga 
ctgeagagcg 
ctcacgggct 
atccatggca 
ggecagggge 
caggacctgg 
gcccgcagag 
tgtgccggcc 



ctgeagaate 
gcaagaaggc 
teteggagaa 
tgtgtcccat 
cccacagcaa 
aggaggagtt 
tccgctgtgt 
ccttccacat 
tgeagaaget 
tgaagcttga 
ccaacggaca 
tccgttgccc 



aneggctgeg 
tgagtttatc 
egggctaegg 
ctgtggtgag 
gagectggae 
tattgagcac 
ggtctgcatg 
gcagaagctg 
ctacaagtgc 
cgtcaatggg 
ggtgggtggc 
cgagtgcagt 



ggaccacaat 
aagggcagtc 
gagcacctgc 
cgcttccctt 
acgggcacct 
tgccagatgc 
cagacagtca 
gegggcaget 
gccctgtgcc 
ctgccctacg 
ctggccccgc 
gtcaagtttg 



ateeggnegg 60 
acaagtgcaa 120 
agacgcaccg 180 
cgctgctgac 240 
gtcgcatctg 300 
accctgacct 3 60 
cttccacgct 420 
cagcggcgtc 480 
tcaaggagtt 540 
gcctctgcgc 600 
ccgagcccgc 660 
agagtgeega 720 



44/228 
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agacctggag agccacatgc T^&tggacca 
ccggaaaggc acccagacat cgccagtgcc 
ccagatgacc ttcgagaacg agagagagat 
ggaaggcatc aaccacgagt gtaagctgtg 
cctctgtcac ctcattgagc acagcttcga 
gtgtttcaca, gtcttcgtcc aggccaacaa 
gcaggaggac aagatctacg actgctcaca 
gctgcagaac cacacgatga gccagcacgc 
tccgcagaag gcttgccgga gacgccgtgg 
aagtgtcatt tgcaacccag atgtaaaact 
ataagcagct ggaaatgaat attaatggca 
tttattgtcc tgctacagct agtgtgcttt 
agttagagaa acctgctttt taaggctatt 
agtttttaaa aaagaattag tgtgattttt 
tttcttcccc cccaccccct acccccttcg 
aattgtgcat cgtgatgtga ttgcttggct 
attaaagact aatgctttga ttggatttgc 
aatgaatata atcggtttca gtgcaactgg 
actgcagtaa ctagtacagt tcaataa 

<210> 98 

<211> 825 

<212> DNA ( 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: LI : 1170349 . 1 : 2000MAY01 



ccgtgacctc 
ccggaaaaag 
ccaaatccac 
caaccagatg 
gggcatgggc 
gttgcagcag 
gtgccctcag 
acagtgaggg 
ggagggccat 
ctaatgattt 
gagattaaaa 
tagactttcc 
gtcctttgtt 
tttctttgct 
gttaactact 
attgtctgaa 
cagttcaccg 
atggtctgct 



a^PCa< 



PCT/US01/06059 



acgccggaga^cagtgggcc 780 
acataccagt gcatcaagtg 840 
gatgccaacc acatgattga 900 
ttcgactccc cggccaagct 960 
ggcaccttca aatgccccgt 1020 
cacatctttg ccgtgcacgg 1080 
aagttcttct tccagaccga 1140 
atcgctcaac aggacacctc 1200 
ttgaacatta catccaatca 1260 
ggccatgagg cgctgctatt 1320 
gtalbtccatg ctcagtattt 1380 
gccgcagact acatttctag 1440 
ccttcatgta ttatattgat 1500 
tctttttttt ctttcttgtt 1560 
ttttaattgc aattctaggt 1620 
tatttccttt taatttttta '1680 
gacagtgatt aaaactatgt 1740 
tttaaatgtg acttaatctg 1800 

1827 



<400> 98 

gccgctgctt 

ctttcttggg 

tttccaggac 

tcagggatga 

cctgaaccct 

ggtctggctg 

gccttggata 

gattaaggag 

gataaaatgt 

gcttactctc 

gaaatacttc 

accctataaa 

tgagagaatt 

ccagatgact 



caggtcggcg 
cagtattttt 
ctggtcacta 
gtgacattcc 
gctcagagag 
ggactttcca 
atggcaaaag 
ttatctgcaa 
acaagaaacg 
cataagagaa 
actcgtcaat 
tgtaacgaat 
cacactggtg 
catctcacac 



ggctccaggg 
cccagcgcca 
ttcaggacac 
aagatgtggc 
atttgtacac 
tttctaaatc 
aagagataag 
agagggctat 
tatgtaagga 
atcatacaca 
cacctctcat 
gtattaaaac 
agaaacctta 
agcatcagac 



agacctgggt 
cgcggaggct 
ggttccagcg 
cattgacttc 
aactgtgatg 
tgtgatttca 
aggcccattg 
taatgaagta 
atgtggaaat 
aaagaaatgc 
tcagcatcaa 
cttcaaccag 
caaatgtaag 
tacacatatg 



gggcagcgtc 
gggccattat 
cagtgcgcgt 
tccaaggaag 
ctggagaatt 
ctgttggaga 
ccagatgtgc 
ttatcgcagt 
ctatactgcc 
aatcagtgtt 
agaatccaca 
agggcacacc 
gaatgcagga 
agaga 



gccgtttctc 60 
gagctctgca 120 
tagccatgtc 180 
agtggggatt 240 
atcagaacct 3 00 
aaaggaaac t 3 60 
caggtgcaga 420 
ttgacacagt 480 
acaatatgca 540 
tagattgtgg 600 
cgggagagag 660 
ttacctagca 720 
aaaccttcag 780 
825 



<210> 99 

<211> 1100 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 
<223> Incyte ID No: 



LG:33 5097.1:2000FEB18 



<400> 99 

tcatctgtgc 

tctgtaccat 

gtctagaaag 

aagattatta 

tttaattttt 

ttaaaatgtg 

aggccaaggt 

gaaaccccgt 

gcccagctac 

agtgagccta 

aaaaaagatg 

tcgatatatg 



tgtgaattat 
ttcctgttac 
taacagttta 
aagttcatta 
tattctcaac 
gaattctcgg 
gggtggatca 
ctctactaaa 
tcaggaggct 
gattgctgcc 
tggaattctt 
tatgctatta 



gtgctgtatt 
tttactgaaa 
cttgtaaacc 
ttactaccct 
ctcatcttac 
ccaggtgcag 
tctgaggtca 
aatacaaaaa 
gaggcaggag 
actgcacfccc 
ttctgcaaat 
ttttacaagt 



gactctcaag 
cacacctact 
acatttctta 
atgatgtgaa 
tgaagagaat 
tgctcacgcc 
ggagttcaag 
ttatctgggt 
aattgcttga 
agcctgggtg 
gttctctaat 
aatacatgtt 



catgatgacc 
ccattaactt 
tccccaataa 
agtgtcattt 
aaaactcttt 
tgtaattcca 
accagcctgg 
gtggtggcgc 
acccaagagg 
acagcagaac 
agtatacctt 
gattgtattg 



attggtggtt 60 
cttgggttaa 120 
gtattttttt 180 
gcttaatctt 240 
taccatattc 300 
tcactttggg 360 
ccaacatggt 420 
gtgcctgtag 480 
tggaggttgc 540 
tctgtctcaa 600 
cttcagtctg 660 
gaaatta-tag 720 



45/228 



BNSOOC1D: <WO 0162927A2 I > 



WO 01/62927 





PCTVUS01/06059 



aaaagattat att^^ftgt ttagaaaata tttttaaatg tgal^feaaa tataaaaatt 780 
actcccttgt tccactttcc ccactctcaa gtcagactat gttgttttca tagttagtag 840 
ctagcagtct accccactag attatatgct tcacagaggg aagggaccct caagacttca 900 
ctggattgag tagcacccaa taccttgctt gctgcctggt ttgtgatggg catactgtaa 960 
gaaaaaaaaa tctgaatgac aaaatgtttt tccataatac cagacttcct cttgaagaga 1020 
tgggtcgtaa tgttgtagtc ttacatgctt acgtagacga tcaaagcaag aatactcaat 1080 
aaatggctat ttaccacttg 1100 

<210> 100 
<211> 584 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 1076451 . 1 :2000FEB18 
<400> 100 

gtgtacttac tttctgccat tatatcctct ttcttaataa agggaaccac atcatattcc 60 

taatcttatt actattaatg acagcacagg catcagattt caaggtgact tgtatgggca 120 

ccccattttt cttctgtttt ggctaacact ttactcgtat tgtttatgag cccccatcag 180 

tcctcagttc ttaatcttat ttcaaaaact .gtggtcatgg gaggctcaga tgggtcataa 240 

cacacattag gttggtcatt tcctgggcta tgtaccttgt atagaataac attatacaag 300 

caagttcttt ttagagttct agtacactta taataaccat aaaataatag gaccatagca 3 60 

1 accttttgtc ctacctcagt gacttgatgt atacactagg aacagtcctc agtccaagga 420 

aggtcagttg aagtccttac tgtacaagtc caaatgttaa ggaacatgag tcccgcgatg 480 

agttttctca tgcttcagcc gtgcgtggac cagtcagctt ccgggtgtga ctggagcagg 540 

gcttgtcgac ttcttcagag ttactttgca ggagttggcg aagc 584 

<210> 101 
<211> 392 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 805478 . 1 :2000FEB01 
<220> 

<221> unsure 
<222> 21 

<223> a, t, c, g, or other 
<400> 101 

gccaacccct gcaaagtgac nctgaagaag acgacaagct ctgctccagt cacacccaga 60 
agctgactgg tccacgcacg gccgaagcat gagaaaactc attgcggaac tcattttcct 120 
taaaatttgg acttttacag taaggacttc aactgacctt cctcagactg aggactgttc 180 
cagtgtatac atcaagtcac tgaggtagga caaaaggttg ctacagtcct attattttat 240 
ggttattata agtgtactgg aactctaaaa ataacttgtt tgtataatgt tattctgtac 300 
aaggtatgta gcccaggaag tgaccaacct gatgtgtgtt atgacccatc tgagcttccc 3 60 
atgaccacgg tttttaaaat aagattaagg ac 392 

<210> 102 
<211> 856 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 101269 . 1 : 2000MAY19 
<400> 102 

ctctccttta ttattaacta aagagcattc tctaccattg cccagctgaa. agcacctagc 60 
tcatagattc tctgtgggga gagtgaataa agaatggggg attgaggcac agagtgttaa 120 
aatagtgaga cacaggattc taactggcag actggcactt tatctcaatt cttaactgct 180 
gtgttgcctc tcactttaaa gacatccaca ttccaccctg gccttgcccg tctggtgagc 240 
ccaatctctg ctggtcagtc acgcagttga ggtcagggaa ggagtaactc aagatgattt 300 
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tgatgttgca 
ccatggatgc 
gaattcttac 
tacggcagca 
cacattatag 
aatgtctact 
actcttgtgg 
tcgtggttga 
aatgatccag 
ttctccctag 



gttttaaatt 
taaaaatggt 
tactacaaaa 
gctatcatat 
ctgagaagag 
tcaagcacat 



actttgac 
tctaactgaa 
ctcacctgga 
tatcaatgcg 1 
aagagcaaga 
ttttggagaa 



acaccaagtg gcgccatatg 
tatttggctc tgtcttttgg 
acatcacacc ttgtgttgac 
agaccc 



ttgagctggg 
aacccaaacc 
tccctaaggg 
gacgcaaaat 
agacgattac 
tggggaagct 
tttgtgatat 
ctcatagcct 
aacgtccatt 



PCT/US01/06059 



aaatcctttg^fcctattata 3 60 
aagaaatagc aacaagtcta 420 
cacagcaaag aatgagctat 480 
acccaggcta cccgccagag 540 
ttcacaaaga tggcagctgt 600 
atgtggttga catcttcacc 660 
tttctttatc ttatattctc 720 
ttcatcatgg cgatctatta 780 
ctttcacagg ggcctttttg 840 

856 



<210> 103 
<211> 856 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 
<223> Incyte ID No: 



LI:331087.1:2000MAY01 



<400> 103 

ccctctctct 

caaattctgg 

cattctgatc 

aaccctcgag 

tgacggcgac 

ggggctcctg 

acaagaagca 
gccggccatc 
ggtgcagcag 
cagcatcttc 
atgtgaccgg 
tccagaggtg 
gtgcctgggt 
gcctgtgctt 
aaac agate t 



ttctgccctg 
gcggcaacgc 
tccaagaggt 
tcaccaccac 
caggtggctg 
gtacgacaga 
ggacccgctg 
egggaggeca 
attcccaaca 
ccaccagtgc 
atgcgctccc 
egggttgagg 
ctgtgcccct 
ctegggggaa 
ttattt 



tagecgeggg 
ggcaggatct 
cccgagagag 
agtgcagccc 
ctgeaggage 
ggggecgagg 
cagccctgcg 
aeggaategt 
gtaacctcct 
tgeaggagge 
agaagctccg 
eggatcgagg 
gcagacagga 
acattcgegt 



cgtccaaatg 
ggactgette 
tgactatgta 
agcccctggt 
tggtgctgtt 
ccaaaagtgt 
accaeggagt 
ggagtgcggg 
cctcctggtg 
gacagaagtc 
ccggcgacca 
gtgggctgga 
gcatataggg 
ttatgccctg 



aagctggaat 
gtcatcgaca 
tgactatcag 
cagcccaatt 
ectgetggag 
cttccatcac 
accccgtgtt 
ccctgccaga 
acagacccca 
aaatataatg 
gactcctgcc 
ttttcatccc 
atgccaatga 
tgacactgtg 



tcctccagcg 60 
acaaegggtt 12 0 
gccatgtgca 180 
tctgccttct 240 
tggagtgtct 3 00 
tcccacaaac 3 60 
tcgtgtacca 420 
aggtatttgt 480 
cctgtgactg 540 
cctctgtcaa 600 
acgccttcca 660 
caaaccctct 720 
acacacctgt 780 
atataataag 840 
856 



<210> 104 

<211> 8530 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: LI : 410188 . 1 : 2000MAY01 
<220> 

<221> unsure 

<222> 6797, 7046, 7252, 7311, 7325-7326, 7502-7503, 7563 
<223> a, t, c, g, or other 



<400> 104 

gccgctgagc 

cagccgccca 

agagcctgtg 

agcagcttcc 

aagcaagccc 

gctccccggc 

ccaccccaag 

aagactttca 

ttgtatgtcc 

tcgctcttca 

cacgaccctc 

gagtctctgg 

gacccatgga 

gacctgggca 

tcagtcattt 



ctgcgcccag 
ccccggggcc 
cccagaagca 
gcaggttcac 
gcggctcaac 
cccagctgga 
agetcategg 
tegtactgaa 
tcagtccctt 
acatgetcat 
caccctggac 
tcaagattct 
actggctgga 
atgtctcagc 
cagggctgaa 



tgccccgagc 
eggceggggg 
ggatgagaag 
aegggagtec 
caccttgcag 
cctgcaggcc 
agagcccctg 
taaaggcaag 
ccacccagtt 
catgtgcacc 
caagtatgtc 
ggctcgagct 
ctttagtgtg 
cttacgcacc 
gaecategtg 



cccgcgccga 
accagcagct 
atggcaaact 
ctggcagcca 
gagagecgag 
tccaaaaagc 
gaggacctgg 
accatcttcc 
eggagagegg 
atcctcacca 
gagtacacct 
ttctgcctgc 
attatcatgg 
ttccgagtcc 
ggggccctga 



gccgagtccg 
tccccacagg 
tcctattacc 
tegagaageg 
aggggctgee 
tgecagatet 
accccttcta 
ggttcagtgc 
ctgtgaagat 
actgcgtgtt 
tcaccgccat 
acgcgttcac 
catacacaac 
tccgggcoct 
tccagtctgt 



cgccaagcag 
caaegtgagg 
tcggggcacc 
catggeggag 
cgaggaggag 
ctatggcaat 
tagcacccaa 
caccaacgcc 
tctggttcac 
catggcccag 
ttacaccttt 
tttccttegg 
tgaatttgtg 
gaaaactata 
gaagaagctg 



60 

120 

180 

240 

300 

3€0 

420 

480 

540 

600 

660 

720 

780 

840 

900 
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gctgatgtga tggtlJKac agtcttctgc ctcagcgtct ttg^Kcat cggcctgcag 960 
ctcttcatgg gcaacctaag gcacaagtgt gtgcgcaact tcacagcgct caacggcacc 1020 
aacggctccg tggaggccga cggcttggtc tgggaatccc tggaccttta cctcagtgat 1080 
1 ccagaaaatt acctgctcaa gaacggcacc tctgatgtgt tactgtgtgg gaacagctct 1140 
gacgctggga catgtccgga gggctaccgg tgcctaaagg caggcgagaa ccccgaccac 1200 
ggctacacca gcttcgattc ctttgcctgg gcctttcttg cactcttccg cctgatgacg 1260 
caggactgct gggagcgcct ctatcagcag accctcaggt ccgcagggaa gatctacatg 1320 
atcttcttca tgcttgtcat cttcctgggg .tccttctacc tggtgaacct gatcctggcc 1380 
gtggtcgcaa tggcctatga ggagcaaaac caagccacca tcgctgagac cgaggagaag 1440 
gaaaagcgct tccaggaggc catggaaatg ctcaagaaag aacacgaggc cctcaccatc 1500 
aggggtgtgg ataccgtgtc ccgtagctcc ttggagatgt cccctttggc cccagtaaac 1560 
agccatgaga gaagaagcaa gaggagaaaa cggatgtctt caggaactga ggagtgtggg 1620 
gaggacaggc tccccaagtc tgactcagaa gatggtccca gagcaatgaa tcatctcagc 1680 
ctcacccgtg gcctcagcag gacttctatg aagccacgtt ccagccgcgg gagcattttc 1740 
i acctttcgca ggcgagacct gggttctgaa gcagattttg cagatgatga aaacagcaca 1800 

gcgcgggaga gcgagagcca ccacacatca ctgctggtgc cctggcccct gcgccggacc 1860 
agtgcccagg gacagcccag tcccggaacc tcggctcctg gccacgccct ccatggcaaa 1920 
aagaacagca ctgtggactg caatggggtg. gtctcattac tgggggcagg cgacccagag 1980 
gccacatccc caggaagcca cctcctccgc cctgtgatgc tagagcaccc gccagacacg 2040 
accacgccat cggaggagcc aggcggcccc cagatgctga cctcccaggc tccgtgtgta 2100 
gatggcttcg aggagccagg agcacggcag cgggccctca gcgcagtcag cgtcctcaca 2160 
agcgcactgg aagagttaga ggagtctcgc cacaagtgtc caccatgctg gaaccgtctc 2220 
gcccagcgct acctgatctg ggagtgctgc ccgctgtgga tgtccatcaa gcagggagtg 2280 
aagttggtgg tcatggaccc gtttactgac ctcaccatca ctatgtgcat cgtactcaac 2340 
acactcttca tggcgctgga gcactacaac atgacaagtg aattcgagga gatgctgcag 2400 
gtcggaaacc tggtcttcac agggattttc acagcagaga tgaccttcaa gatcattgcc 2460 
ctcgacccct actactactt ccaacagggc tggaacatct tcgacagcat catcgtcatc 2520 
cttagcctca tggagctggg cctgtcccgc atgagcaact tgtcggtgct gcgctccttc 2580 
cgcctgctgc gggtcttcaa gctggccaaa tcatggccca ccctgaacac actcatcaag 2640 
, atcatcggga actcagtggg ggcactgggg aacctgacac tggtgctagc catcatcgtg 2700 
ttcatctttg ctctggttgg caagcagctc ctaggggaaa actaccgtaa caaccgaaaa 2760 
aatatctccg cgccccatga agactggccc cgctggcaca tgcacgactt cttccactct 2820 
ttcctcattg tcttccgtat cctctgtgga gagtggattg agaacatgtg ggcctgcatg 2880 
gaagttggcc aaaaatccat atgcctcatc cttttcttga cggtgatggt gctagggaac 2940 
ctggtggtgc ttaacctgtt catcgccctg ctattgaact ctttcagtgc tgacaacctc 3000 
acagccccgg aggacgatgg ggaggtgaac aacctgcagg tggccctggc ccgcatccag 3060 
i aggggcctgc gctttgtcaa gcggaccacc tgggatttct gctgtggtct cctgcggcac 3120 

cggcctcaga agcccgcagc ccttgccgcc cagggccagc tgcccagctg cattgccacc 3180 
ccctactccc cgccaccccc agagacggag aaggtgcctc ccacccgcaa ggaaacacag 3240 
tttgaggaag gcgagcaacc aggccagggc acccccgggg atccagagcc cgtgtgtgtg 3300 
cccatcgctg tggccgagtc agacacagat gaccaagaag aggatgagga gaacagcctg 3360 
ggcacggagg aggagtccag caagcagcag gaatcccagc ctgtgtccgg ctggcccaga 3420 
ggccctccgg attccaggac ctggagccag gtgtcagcga ctgcctcctc tgaggccgag 3480 
gccagtgcat ctcaggccga ctggcggcag cagtggaaag cggaacccca ggccccaggg 3540 
tgcggtgaga ccccagagga cagttgctcc gagggcagca cagcagacat gaccaacacc 3600 
gctgagctcc tggagcagat ccctgacctc ggccaggatg tcaaggaccc agaggactgc 3660 
ttcactgaag gctgtgtccg gcgctgtccc tgctgtgcgg tggacaccac acaggcccca 3720 
gggaaggtct ggtggcggtt gcgcaagacc tgctaccaca tcgtggagca cagctggttc 3780 
gagacattca tcatcttcat gatcctactc agcagtggag cgctggcctt cgaggacatc 3840 
tacctagagg agcggaagac catcaaggtt ctgcttgagt atgccgacaa gatgttcaca 3900 
tatgtcttcg tgctggagat gctgctcaag tgggtggcct acggcttcaa gaagtacttc 3960 
accaatgcct ggtgctggct cgacttcctc atcgtagacg tctctctggt cagcctggtg 4020 
gccaacaccc tgggctttgc cgagatgggc cccatcaagt cactgcggac gctgcgtgca 4080 
ctccgtcctc tgagagctct gtcacgattt gagggcatga gggtggtggt caatgccctg 4140 
gtgggcgcca tcccgtccat catgaacgtc ctcctcgtct gcctcatctt ctggctcatc 4200 
ttcagcatca tgggcgtgaa cctctttgcg gggaagtttg ggaggtgcat caactatacc 4260 
gatggagagt tttcccttgt acctttgtcg attgtgaata acaagtctga ctgcaagatt 4320 
caaaactcca ctggcagctt cttctgggtc aatgtgaaag tcaactttga taatgttgca 4380 
atgggttacc ttgcacttct gcaggtggca acctttaaag gctggatgga cattatgtat 4440 
gcagctgttg attcccggga ggtcaacatg caacccaagt gggaggacaa cgtgtacatg 4500 
tatttgtact ttgtcatctt catcattttt ggaggcttct tcacactgaa tctctttgtt 4560 
ggggtcataa ttgacaactt caatcaacag aaaaaaaagt tagggggcca ggacatcttc 4620 
atgacagagg agcagaagaa atactacaat gccatgaaga agttgggctc caagaagccc 4680 
cagaagccca tcccacggcc cctgaacaag ttccagggtt ttgtctttga catcgtgacc 4740 
agacaagctt ttgacatcac catcatggtc ctcatctgcc taaacatggt tcaccatgat 4800 
ggtggagact gatgaccaaa gtgaagaaaa gacgaaaatt ctgggcaaaa tcaaccagtt 4860 
ctttgtggcc gtcttcacag gcgaatgtgt catgaagatg ttcgctttga ggcagtacta 4920 
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cttcacaaat ggctggaatg ^^fcttgactt 
gattttttct gcaattctta agtcacttca 
catccgcctg gcccgaattg gccgcatcct 
cacactgctc tttgccctca tgatgtccct 
attccttgtc atgttcatct actccatctt 
ggaggc.tggc atcgacgaca tgttcaactt 
cttccagatt accacgtcgg ccggctggga 
gcccccctac tgtgacccca atctgcccaa 
cccagccgta ggcatcatct tcttcaccac 
caacatgtac attgcagtga ttctggagaa 
gcctctgagt gaggacgact ttgacatgtt 
ggccactcag tttattacct tttctgctct 
cctgagaatc ccaaaaccca atcgaaatat 
tggagataag atccactgct tggacatcct 
atccggggag ttggattctc tgaaggcaaa 
atccaagatc tcctacgagc ccatcaccac 
atccagagag ccttccgcag 
ttccgtcagc aggcgggcag 
atcgcctacg tgatgagtga 
atctcctcca cttccttccc 
ctccaggtgc gggggtctga 
ccggacaggg accgtgagtc 
gacacactga aaagcagcct ttttcaccat 
agcctggggc cttcctggct ttgggagtaa 
tgtggcgccg cgtggacagc 
gtgacctggt ctggtcaggc 
gctgaggcga aatataaaac 
tatatttgat atttttttac 
caattcacgc tgtctcttct 
tgttctcaaa gcagaagtgg 
cgaatggggt ccccctccca 
cccactcaca caagcanaca cacacagtcc 
gacccaggct cccagcctaa gggagacagg 
ttcttgtcca cggggctcac tctggccccc 
tgtgttttca cgcaggtcat attgtcagtc 
tggcctgggg tcccagggct gggccntagg 
aacaaaacca aaggacacaa 
ttgctcctgg aacctggcag 
ggwgtttgg ggctaagaag 
agaggaaact 



PCT7US01/06059 



ggccatggtt 
ctccttcctc 
agagggcctc 
cagctcctcc 
cagcgataac 
ccccccttct 



ggcagagttc 
agatagacct 
catcccagct 
ataaatttat 
acatgggcag 
ccggaagtcc 
gcccaggggt 



taagagtatt 
cagcttgtcc 
get gage cat 
ctgcctgaag 



tttcnngtgt ggtctccagc 
ttggccctgg ccctgtgcag 



ggcgatctcc 
tcctctccag 
ggaaagggat 
t tgagaga gg 
ggcacccaca 
aaaggctttt 
gctgacaatc 
aacagggtca 
tcccacggag 
ctctgggctg 



gggaatgaac catggcacct 
cnnagtcccg ggatcctgat 
ggntcccaba ggtcttgtga 
cccccaggac ctcttcagct 
gagaactgtc accctgaggt 
tggctgcccc attccagctc 
acctcagtct cacctgccgg 
ggagggctgg acatttgtgg cagtgctgaa 
tttcttcttg tcaccccagt tcccttggtg 
cagatctagt tctcttctcc tgtgttccaa 
ctgtccaagc agagatttct ccaagtggaa 
gctctcaagg cttggagagg tatgtcccac 
ggcaggtaat gcggattgga ggcaggtegg 
gtggtttttg tgtgtgtgga cagcccggga 
cagacggtgt ggttgggggt gggaggtgag 
tgtcaccttt ttaatttcca gaactgeaca 
aaaaaggagg gaaggaaaga catcaagtgc 
tcaccaaact tcatgtataa ataaaataca 
catgacctgg 

<210> 105 

<211> 2965 

<212> DNA 

<213> Homo sapiens 



cattgtggtg gttctctcca'^^gcgagcct 4980 
aagttacttc tccccaacgc tcttcagagt 5040 
cagactgatc cgagcggcca aggggatccg 5100 . 
gcctgccctc ttcaacatcg ggctgttgct 5160 
eggtatgtec agctttcccc atgtgaggtg, 5220 
ccagaccttc gccaacagca tgctgtgcct 5280 
tggcctcctc agccccatcc tcaacacagg 5340 
cagcaatggc accagagggg actgtgggag 5400 
ctacatcatc atctccttcc tcatcgtggt 5460 
cttcaatgtg gecaeggagg agagcactga 5520 
ctatgagacc tgggagaagt ttgacccaga 5580 
cteggacttt gcagacactc tctctggtcc 5640 
actgatccag atggacctgc ctttggtccc 5700 
ttttgettte accaagaatg tcctaggaga 5760 
tatggaggag aagttcatgg cagccaaccc 5820 
cacactccgg cgcaagcacg aagaggtgtc 5880 
gcacctgctg caacgctctt tgaagcatgc 5940 
cggcctctcc gaagaggatg cccctgagcg 6000 
gaacttctcc cgaccccttg gcccaccctc 6060 
accctcctat gacagtgtca ctagagccac 6120 
ctacagccac agtgaagatc tcgccgactt 6180 
catcgtgtga gcctcggcct ggctggccag 6240 
ggcaaaccta aatgeagtea gtcacaaacc 63 00 
gaaatgggcc tcggccccgc ggatcaacca 63 60 
eggagcagtt ggcctgtgct tggaggcetc 6420 
aatgcccctg cggctctgga aaagcaactt 6480 
tgagactgta tatgttgtga atgggctttc 6540 
ttgagcaaag aactaaggat ttttccatgg 6600 
taaccctgaa caagagtgtc tatggagcag 6660 
aatccagtgt ggctcccaca ggtcttcact 6720 
cttgacctga gatgctggga gggctgaacc 6780 
tcacacacgg aggecagaca caggccgtgg 6840 
cctttccctg ccggcccccc aaggatgggg 6900 
tattgtctcc caaggtccca ttttccccct 6960 
ctacaaaaat aaaaggcttc cagaggagag 7020 
cactgatagt tgecttttet tcccctcctg 7080 
gggtgcaagc cccattcacg gcctggcatg 7140 
gccctgccag ccagccaatg gaagagaggg 7200 
ttcaccagcc ctgagccatg gnccctcagc 7260 
cagggctctc tggaccatac neggaggagt 732 0 
acacagagac atgggagtgg ggagcggacg 7380 
ggtcaggccc agttctcgtg ccccttagag 7440 
gggcactgtg gtcaggccca gcctctctgg 7500 
cagaggacct ctttggggca agatccaggt 7560 
tcagggaaaa a tat tt tact agtccaatca 7620 
ctatttagca tatgeaaate ttttaacata 7680 
actggcgaag ctgaggcagg ccagggggct 7740 
cccctccaac egggegcatg ctcccaggcc 7800 
gctgctccta acctacctcg ccgagctgtc 7860 
gggggcattg ccggcgagta aagtattatg 7 920 
gcaaccccag acccaaaccc atgcccctga 7 980 
tttgagtcca agtgtgggac aaggttttaa 8040 
atccatattt ttgtgagatc tecatgettt 8100 
ctccatgggt tgctcaccgc ctgctacaca 8160 
gctgccagcc cagctggccg gaaggagact 8220 
gctttgagac aggtgcctgg gggctggctg 8280 
ctagacccaa cccttagctt ttagcctggc 8340 
atgaccagca ggaggggaga agagagtagg 8400 
cagatgttgt ctgaactaat cgagcacttc 8460 
tatttttaaa acaaaccaat aaatggctta 8520 

8530 



49/228 



BNSDOCIP: <WO 01 62927A2 I > 
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<220> 

<221> misc_f eature 

<223> Incyte ID No: LI : 1188288 . 1 : 2000MAY01 
<400> 105 

ctgaggtggc gtgggcgccc tgcgggccgg gcctccgcgg cgcgtccagt cccagtctcc 60 
gcgctggtct, ggcccggtcc ctcagctcca ccccctgcct ccccctcccg cccgctcgcg 120 
cctcctgaaa agccagctcg cccggggcgg cggcgcagag ccgggccggc gcacgaggca 180 
gccagcgcgg ccatgacggc ggagaaggcg ctgcccctgg gcaatgggaa ggctgctgag 240 
gaggcgcggg agtccgaggc gccgggtggc ggctgcagca gcgggggcgc ggcgcccgcg 300 
cgccacccgc gcgtcaagcg cgacaaggcg gtccacgagc gcggccactg gaacaacaag 360 
gtggagttcg tgctgagcgt ggccggggag atcattgggc ctgggcaacg tgtggcgctt 420 
cccctacctt gtgctacaag aacggaggag gggcattcct gattccctac gtggtgtttt 480 
ttatttgctg tggaattcct gtttttttcc tggagacagc tctggggcag ttcacaagtg 540 
aaggtggcat tacgtgttgg aggaaagttt gccctttatt tgaajggcatt ggctatgcaa 600 
cacaggtgat tgaggcccat ctgaatgtgt actacatcat catcctggca tgggccattt 660 
tttacctgag caactgcttc actactgagc taccctgggc tacctgtggg catgagtgga 720 
acacagagaa ttgtgtggag ttccagaaac tgaatgtgag caactacagc catgtgtctc 780 
tgcagaaatg ccacctcccc tgtcatggag ttttgggagc accgggtcct gggcatctct 840 
gacgggatcg agcacatcgg gaaccttcgc tgggagctgg ccttgtgtct cttggcagcc 900 
tggaccatct gttacttctg tatctggaag gggaccaagt ctacaggaaa ggttgtatac 960 
gtgactgcga cattccccta catcatgctg ctgatcctcc tgatacgagg ggtcacgttg 1020 
cccggggcct cagagggcat caagttctac ttgtaccctg acctctcccg gctctccgac 1080 
ccccaggtct gggtagatgc tggaacgcag atctttttct ' cctatgccat ttgcctgggc 1140 
tgtctgaccg ctctggfcraag ttataacaat tataacaaca actgctacag ggactgcatc 1200 
atgctctgtt gcctgaacag cggcaccagc tttcgtggct gggtttgcca tcttctcagt 1260 
cctgggtttt atggcgtacg agcagggggt acccattgct gaggtggcag agtcaggccc 1320 
cggtctggcc tttattgcgt accc'caaggc ggtcaccatg atgcccctct ccccgctgtg 1380 
ggccaccttg ttcttcatga tgctcatctt cctgggcctg gacagccagt ttgtgtgtgt 1440 
ggaaagcctg gtgaccgccg tggtggacat gtaccccaat ggttttcccg gaggggttac 1500 
cggcgggagc tgttcatcct agccttgtct gttatctcct attttctggg cctcgtgatg 1560 
ttaacagagg gtggcatgta catcttccag ctctttgact cctatgccgc cagtgggatg 1620 
tgccttctct tcagcggcca tctttgagtg catctgcatc ggctgggtgt atgggaagca 1680 
accggttcta tgataacatt gaagacatga ttggctaccg gccaccgtcg ctcattaagt 1740 
ggtgctggat gatcatgacc cctgggatct gacgacgggg atcttcatgc ttgcttcttg 1800 
atcaaggtac aagaccactg caaggtacaa caacatctac acctacccag cctggggcta 1860 
atggcattgg ctggctcatg ggccctgtcc tccatgctct gcatcccgct ctggatctgc 1920 
atcacagtgt ggaagtacgg aggggacgct gcccgagaaa ctccagaagt tgacgacccc 1980 
cagcacagat ctgaaaatgt cggggcaaga cttggggtga gcccacggat ggtgacagtt 2040 
aatgactgtg atgccaaact caagagtgac gggaccatcg cagccatcac agagacagga 2100 
gaacgccact tctgagcggg ccaaaccaaa gccatcttgg cggctccttc ttcctttctt 2160 
ccccccgtgt atgtaaatga attcctgaac cccatacttc acctaatggt aggggcttgc 2220 
ttgttttgca caggattaat taacaagtta attttaaggt ggccactgta cactgctcta 2280 
aagtcatatc ccctccccgc cccccagttc atcatggaag taaccactac aaagagataa 2340 
cactgtacgg ggactgccag atcttctgtc ctagggcagt tttaatcaat gttttctagg 2400 
gattaggaag aagtgtataa tattgtaaaa ctttgtatta ctgggggttg tgggccaggt 2460 
ggggtgggga gtgatggctg tcctcctcgt gtcggccttt gtaatgaccg ttctgtgtcc 2520 
cagtgtatgt ctctgaccat ccacagccgc ccctcctcca tacccccggg ataaattttc 2580 
cctcaaaccc tccccttcca caagggacac tggggttggc agacagcaag gctattgtgg 2640 
cagagaagaa aaagtccagc agactcagga ctcctgggct gcaggccccc ctctggcctg 2700 
atcccatgaa ggtgaccttg ggaactccct ttccctccct gggcctcagt ttccccctct 2760 
ccccaatgag aggaggagcc cgattggggt atttcttagg gatatctctt tgtgtgccta 2820 
tgattgcttc cagacctgcc gtggaccttg gcttccttct gaagacttgc aggccattat 2880 
ctccgccctc cccagttgta cctcctcccc acctgcttat ttaaaatcca gtttcttgtc 2940 
gaaacacaca caaaacacac acaaa 2965 

<210> 106 

<211> 2376 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 427997 . 4 : 2000MAY01 
<400> 106 

cgcgtacagg aggcggcggc ggctcccagt caccggcccc cgacggcgag cgcacgatgc 60 



50/228 



BNSDOCID: <WO 01 62927 A2J_> 



WO 01/62927 



actgcctggg 
gaccccagat 
ggactggttt 
tctttggcta 
cccctctttc, 
acctggaagt 
aagagcagct 
tgtgtggtta 
gtcccaggct 
aaacacctga 
accgaattct 
cattagcctt 
tcacattttc 
ggctgtatta 
atgaagtgcg 
gtcttgatat 
ccaaataaag 
ggacgatttg 
cttgggaact 
ctttcgggca 
gaattcatcc 
caacagaagt 
tcttcttcta 
attccgagaa 
gcattcgtac 
aaatacctcg 
cagatttcgt 
tttacatcta 
gtgtttcact 
tggggcagtc 
tttatctgtg 
aacaaatgat 
aaaaaggagc 
gaatgaggag 
tgtacctggg 
actagagata 
ctcattgtac 
tacctttttt 
aacttacaaa 



cgccgagtac 
ttataggaaa 
ggtgtttatc 
tgacagcttt 
agggcaggac 
caaaggtacg 
tgactccctg 
tagtttgaat 
accagttcct 
gtgctactcc 
aagtggaatc 
gtctttggcc 
atttcattgg 
tgactatacc 
tgctggggta 
tttgttctca 
ccaatcagca 
gccatcctca 
gcagggagct 
ttccggtaac 
ttgcgtgcca 
aaaaaatgat 
ccatccaagg 
tcattgtcat 
caggtacctg 
ctccatctca 
gtacatcagc 
ttaatctgct 
gttttttgga 
cctctgttat 
tttgaaactg 
ggatcgtcag 
aacaaattaa 
gggaacagaa 
aaaacatttc 
agttrtgtgaa 
tttgtcatta 
tgtacaaatt 
gtacactatg 



ctggtttctg 
tgcacagata 
atgggctact 
ggcaacatgt 
atgaccctaa 
cagctcaacc 
gaagaggtcc 
tccttcaact 
ccaagcaagt 
ctatttgcat 
atgtcgggaa 
atgatgttga 
ttattttggg 
aacgacctca 
tgctatcgta 
gaaagagaat 
gttggctccc 
ttgttcttcg 
gcccaggtta 
atgtgggtcg 
gcaaatgact 
cctccgtgat 
aaccattgtg 
gtacatggca 
ttccggatga 
agccagaatg 
aaaagatgca 
ttggagactt 
ggactcatgg 
tggtagcttt 
tgctggatgc 
aaaagcccta 
acaatgcaag 
ctcccaggcc 
cttgctaaga 
tttttttttc 
ttgttcJgaac 
aggacagaaa 
taagaactga 



ccgacggcgc 
cggcatggtt 
cggtggtggc 
gtggcaagaa 
aaaaacacgt 
gcatggccct 
agttctttgc 
atacccacag 
catttccctt 
ctgttttgat 
gagatacaat 
ccgttcagat 
attgttgttt 
gcatagaatt 
atccacaggg 
aaaattgaca 
ttcctgggct 
tgggtcctct 
tgggaaggcg 
taccatttaa 
atatgctggg 
catcccagcc 
aaagggtcat 
acaacgcagc 
ctgcttacgg 
catatactac 
ttcaaaatct 
cataattttt 
cttttaacta 
ttttggccta 
acttttcctg 
ctttatggat 
ggccacagca 
attgtgcaga 
ggccatttac 
tcaaaagacc 
caggataaca 
aactcttcta 
ggtaag 



PCT/US01/06059 



ccctaggcaa agggagtggc 120 
attcctgttc tttctctttt 180 
tggagccgcg ggaagactcc 240 
gaactccccc gtggaagggg 3 00 
gttctttatg aattcctgca 3 60 
ctgtgtatcc aactgccctg 420 
aaacaccagt gggtccttcc 480 
tccaaaagca gactcactgt 540 
atttaaccga tgtgtccctc 600 
aaatgatgtt gacaccctcc 660 
ccttggcctg tgtatcctcg 720 
tcatacacca cccttctggt 780 
agtctgcggt gttttatggt 840 
ggacacagaa cagggaaaat 900 
catcacggca gatgctgctc ,960 
agttgagctt ttccaaatca 1020 
ggttccaagc caactgggtg 1080 
gggtggctgt gctgctgagc 1140 
gccaagtgga atataagccc 1200 
ttggcctcat ctggactagt 1260 
gcagtggtta cttgttattt 1320 
gtttcgtctc gtccccattc 1380 
ttttaatctc tgtggtgagg 1440 
tgacaagata cagcatcggt 1500 
ctgttcctgg tgtctggaca 1560 
aatcggccta ttacatggga 1620 
tgtccaagaa ctcaagtcac 1680 
ctaggaaagg tgttagtgct 1740 
caatcgggca tttccaggtg 1800 
cttagttagc ccatagtttt 1860 
tgttttgctg ttgatctgga 1920 
caagaatttc tgagtttcgt 1980 
gggacaagca ctccattaag 2040 
tagataccca tttaggtatc 2100 
agaatagaag atgagtatcc 2160 
taataaaacc ctattcatca 2220 
atactggaac tatattagtt 2280 
aaaccatgtt tatatgcatc 2340 

2376 



<210> 107 
<211> 821 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: IiG:451682 . 1 :2000FEB18 



<400> 107 

cccatctcgc 

cggaggggcg 

ccctcgggat 

agctcctcca 

gcgctgtggc 

cccagcgcta 

tctgcgacac 

tcctattcgc 

gcaactacgg 

tgctcaagca 

aggtcctcag 

aggtatttct 

tgwgaagct 

gtttgcagtg 

<210> 108 
<211> 435 



cgccgccgcg 
tctctaccag 
cctggcggcc 
gacctcccgc 
agggatcatg 
cgcgctctcc 
caagcagggt 
cggttgggac 
cgggtggaag 
ggactacaag 
caagaccatg 
gcagcccgac 
gcttgcaaac 
gcatgtgatg 



atgtctcgcc 
gtggagtacg 
gacggcgtag 
tccgccgaga 
tccgacgcca 
taccaggagc 
tacacccagt 
aagcaccacg 
gccgccgctg 
gacgccttga 
gactc caeca 
aceggggagg 
tccggcctga 
ccttctactg 



gctatgacag 
cgatggaggc 
tectggtegg 
agatgtacaa 
acatcctcat 
ccatccccgt 
ttggtggcct 
gtttccagct 
ttggggccaa 
cccgcgagga 
gettgactge 
tacagtacca 
cacagccaac 
ctcttagaca 



tcgcacgacg 
gateggcaac 
cgagaagaag 
gatcgactcc 
caacaccgcc 
cgagcagctc 
ccgccccttt 
ctacatgagc 
cagccaggcc 
ggccgtcggt 
cgagaagctg 
ggtgtgctct 
acccgaggca 
t 



atcttctcgc 60 
geegggtcag 120 
gtgacctcca 180 
cacctggcgt 240 
cgcctccacg 300 
gtccagtccc 360 
ggggtctcct 420 
gacccatctg 480 
gcgcagtcca 540 
ctcgcgctca 600 
gagctggccg 6€0 
cctgaggcgc 720 
taatgtactg 780 
821 



51/228 



BNSDOCID: <WO 0162927A2J_> 
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<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LG: 1077283 . 1 :2000FEB18 
<400> 108 

cgctgtctct tttcgccgct tgctgcagac atggtcaacc ccaccgtgtt cttcgacatc 6,0 
acggctgatg gcgagccttt gggtcgcggt ctggcttcgg aagtggttgg cagacaaagt 120 
tccaaagaca gcagaaaact ttcgtgctct gagcactggg gagaaaggat ttggctataa 180 
cgggttcctg ctttcacaga attattccag gattcatgtg ccagggtggt gacttcacac 240 
gccataatgg cactggtggc aagtccatct acggagcaga aatttgcagg atgagaactt 300 
catcctgaag catacaggtc ctggcatctt gtccatggca aatgct'ggac caaacacaaa 360 
tggttcccag ttttttatct gcactgccaa gactgagtgg ctggatggca agcatgtggt 420 
ctttgggaag gtgaa 435 

<210> 109 

<211> 1430 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No; LG:481436 . 5 :2000FEB18 
<400> 109 

ggcagatttt ctaaacatag aaagggagtt cacatgctac atggtcaaaa ataaaactga 60 
tacatgtcca tttcagtatt caataaatat ttgtcagatg agtgaatgag agaatcagag 120 
tcatgatttc ctgtctctag acctttaagt caccaaacat tgaagtgata atggtgttgc 180 
taatgtaatt gaaaataaaa tcgaaggagc aagctatgga ggggaacagg * gatgaagctg 240 
agaaatgtgt cgacatcgtc caggaggctc tgaatgctgg caaccgcgag aaggcccagc 300 
gtttcctgca gaaggccgag aagctctacc cactgccctt ggcctgcgca taaacaaata 360 
tcaaaattac tatgaaagta cttggagtta cgaaagatag tggtgatgaa gatttgaaaa 420 
aagcttatag aaagcttgct ttgaagtttc atccagacaa aaaccatgca cctggagcaa 480 
cagatgcttt taaaaagatt ggaaatgctt atgctgtttt aagtaatcca gaaaagcgaa 540 
aacagtatga cctcacgggc aatgaagaac aagcatgtaa ccaccaaaac aatggcagat 600 
ttaatttcca tagaggttgt gaagctgata taactccaga agacttgttt aatatatttt 660 
ttgggggtgg atttccttca ggtagtgtac attctttttc aaatggaaga gctggttata 720 
gccaacaaca tcagcatcga catagtggac atgaaagaga agaggaaaga ggagatggag 780 
gtttttctgt gtttatccag ctgatgccca taattgtatt gatcctcgtg tcattattaa 840 
gccagttgat ggtctctaat cctccttatt ccttatatcc cagatctgga actgggcaaa 900 
ctattaaaat gcaaacagaa aacttgggtg ttgtttatta tgtcaacaag gacttcaaaa 960 
atgaatataa aggaatgtta ttacaaaagg tagaaaagag tgtggaggaa gattatgtga 1020 
ctaatattcg aaataactgc tggaaagaaa gacaacaaaa aacagatatg cagtatgcag 1080 
caaaagtata ccgtgatgat cgactccgaa ggaaggcaga tgccttgagc atggacaact 1140 
gtaaagaatt agagcggctt accagtcttt ataaaggagg atgaactgga atttttattt 1200 
atacctttta gcgtactctt tattttttct gtaagtaagt ttggtttcat catgagggat 1260 
gaaggaaaag atttgatact gaaaactaaa ctgaatagtt ggttcctgaa atcttggact 1320 
gtttatgacc tactggctcc tttaaatagt aactgaaaac taaaatggaa tattttagtt 1380 
aacgcttcta caagtatttt cattttaaaa gcttacatga ttcctaacta 1430 

<210> 110 
<211> 889 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 7937 01 . 1 :2000FEB01 
<220> 

<221> unsure 

<222> 730, 733-735, 797-798 
<223> a, t, c, g, or other 

<400> 110 
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ccgacagcat 
cagcctcctt 
gagtgtggat 
ctgtcgtcct 
acaaccgtgt 
cctgcagagt 
tggtccacgc 
tggacttata 
attcatcagg 
tatgaatgtg 
gtagcccgag 
ccgtttttga 
ctagaacagn 
ccattaatag 
gagtagaaaa 



aatgtgaggc 
tgaagatctc 
caaggactgg 
gatcactccc 
aaaacctgca 
gaccctgaag 
acagccgaag 
cagtaagggc 
tcaccgaagt 
tggaaacaat 
aaatgaccga 
aataagaata 
cannnaagaa 
taaaaannta 
taaatatgtt 



aaggtctagc 
caggtggact 
aacgtagcct 
actgctgtga 
gcgcctgaat 
aagatgacaa 
catgaggaag 
ttcaactgat 
aggacagcaa 
aaaaaaactt 
cctgatgcgt 
agaactggcc 
attcccaagc 
gaaataggat 
tgtcatgagt 



tgttcccccc 
tcacagagat 
ctttgtgccc 
acgtagagag 
cctgggaggc 
gccctgctcc 
ctcattgtga 
cttactcaaa 
attaaaacaa 
gtttgtataa 
gttataaccc 
ttttcctagg 
aaataacttt 
gtggttctct 
caggggtttg 



cggcatacaa 
gccaaagtgt 
atggtggaaa 
aatcctagcc 
aagaccaagt 
agtcacaccc 
gattcatttt 
ctggggactg 
tctttctgtt 
tgccactcag 
atctgagccc 
tgatacaagt 
aagatttgat 
taactgagaa 
taaaaattg 



gcttatagag 60 
ggaggtgttc 120 
ggaccccaga 180 
tggatccatc 240 
ctggacaacc 300 
agaagctgac 360 
tcttaaattt 42 0 
ttcccagtgt 480 
ctatagttat 540 
tacaaggtat 600 
tccgcaacca 660 
aaaataataa 720 
gcttgtgcag 780 
aggagctaaa 840 
889 



<210> 111 
<211> 1106 
<212> DNA 

<213* Homo sapiens 
<220> 

<221> misc_f eature 
<223> Incyte ID No: 



1,1:373637. 1:2000FEB01 



<220> 

<221> unsure 
<222> 974 
,<223> a, t, c, 



g, or other 



<400> 111 

gccatccagc 

ggccatcaag 

gaacaaggag 

ggacgccaag 

cacaggcggc 

cgacgtcttc 

cccgctggag 

ctgcacccct 

ttgatacagg 

tgtcctacgg 

ttgatggtaa 

tggaggaaga 

ccgcattgac 

cggacttttt 

ataaacactt 

ttgtctttca 

gaggcagaca 

attacctttt 

attgtttgtg 



atggtggact 
aaggcgtacc 
gaagcggaga 
aaacgcgata 
aggcccttcg 
agggagttct 
aatattttgg 
tttctctgcc 
atttcgttcc 
tagtgatggg 
aaaaatcacc 
tggagagttg 
accaagtaat 
ttaggattga 
catgccaaca 
atcgttgtaa 
atgnttattt 
tccgttggga 
tgtagtggtg 



actacgaggt 
gcaagctggc 
ggagattcaa 
tctatgaccg 
aggacccctt 
tcggcggcca 

gggggtcaga 

ttcagtgaat 
tttggctccc 
acaggcagct 
accaagagaa 
agttaaagtc 
tccagtccac 
aagtgaactt 
cctgttctta 
aatatctgta 
gctagtggta 
aagtgtgagc 
aaatct 



gctggacgtg 
gctcaagtgg 
gcaggtggcc 
ctatggcgag 
cgagtacgtc 
ggacccattc 
ggaactgctg 
ttccagcttt 
tgggaagtgg 
tcaagtccat 
tcattgagaa 
cttcataata 
atttgacttt 
tactttcaga 
tcgttgggac 
tgcaatttgc 
ggacagaatg 
caggtggttt 



ccccggcagg 
caccccgaca 
gaggcc tacg 
gcgggggcgg 
ttcagcttcc 
tcctttgacc 
gggaagcaga 
tgggggtgtt 
gggcctttct 
gtcgacttcc 
tggccaagaa 
aatagcaaag 
aagcacatct 
acagctgtac 
tacacggata 
tatttattaa 
ttcagttcag 
gctctaaatc 



cctcatccga 
aaaaccctga 
aggtgttgtc 
agggcggctg 
gcgacccagc 
tcttgggaaa 
agcagagcgt 
ttttcttctt 
tccttctgca 
actgaaatag 
agggtggaag 
agcagttact 
ggaggaatag 
ccaagaattt 
ggacctctgt 
acttaagcct 
cactttgagc 
atttgacttt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1106 



<210> 112 
<211> 461 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: LG: 23 93 68 . 2 : 200 0MAY19 



<400> 112 

tcgactccga 

acgcccagcc 

tagatttggg 

ttttcacctc 

gatctccagg 

gaccttcact 



cctcgtgatc 
cctgcaagca 
acaagttgcc 
cgggatcccc 
aagtgcctct 
agctcctggt 



tgccccctcg 
gtttctttaa 
cagctctgct 
agggcttcct 
gtgcctcctc 
aactccgatg 



gcctcccaat 
ctctgttggg 
aggaagagtc 
gggttgactc 
tgtctggtcc 
tatttcttca 



gctggatacg 
atgccattcg 
agtcctcagg 
aggacttcat 
aggttgtggt 
tttgttgccc 



gcatgagcac 60 
tgggcaaagc 12 0 
gggaagtttc 180 
acgcagcctg 240 
ctgggtgcca 3 00 
cttctgagag 360 



53/228 
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WO 01/62927 PCT/US01/06059 
j^Kgcca gctgacgctt ctcatcctga aaactgt 



gcccaaaacc tggtaagcca gctgacgctt ctcatcctga aaaccgtgaa caaactcata 420 
gagcttcgcc cactcctgaa agcagctgct gttgaagcca g 461 

<210> 113 

<211> 1450 

<212> DNA 

<213> Homo, sapiens 

<220> ' , 

<221> misc_feature 

<223> Incyte ID No: LI : 053826 . 1 : 2000MAY01 

<400> 113 ; ' - 

gaattccccc cccccccccc tcacttggtg tgtctatatg tctggcagac attatcagca 60 
cattctctgt tgttacctgt gattcatttt ttcttcactc tccaggtgaa tttcaattgc 120 
tgaaaatttc ccactgaaaa tatgcagtaa tatattttgt ggttcagaca tttggggcaa 180 
atggttcaca ttcattttag ggttagtggt catgctgttt atttttctct gctatacaaa . 240 
gttcctctta ggggtctgcc tcatgacact aaaaaatgaa tagagattct actgtaggtt 3 00 
atctcctagg cttgagttca acatttgttt ggatttttga agaaagtcaa atcaagcaat 3 60 , 
gctcccaaat gatgtctttg taaattcata ccctctggcc ctattttttt tcatagaccc 420 
taactctacc. tttctgcttt aaagcaaagt aaactcggtg gcctcttctt ctccacccct 480 
caaaatgata gcaatctctg ccgtcagcag tgcactcctg ttctcccttc tctgtgaagc 540 
aagtaccgtc gtcctactca attccactga ctcatccccg ccaaccaata atttcactga 600 
tattgaagca gctctgaaag cacaattaga ttcagcggat atccccaaag ccaggcggaa 660 
gcgctacatt tcgcagaatg acatgatcgc cattcttgat tatcataatc cagttccggg 720 
ccaaggggtc ccccccggag gcaatattgc atattttggt tggggttgaa atcctgtgcc 780 
aatctcggca aaggcttggg cggctacttg catttgggac catggacctt cttacttact 840 
gagatttttg ggccaaaatc tatctgtacg cactggaagg taggaaatat cgctctattc 900 
tccagttggt caagccatgg tatgatgaag tgaaagatta tgcttttcca tatccccagg 960 
attgcaaccc cagatgtcct atgagatgtt ttggtcccat gtgcacacat tatacgcaga 1020 
tggtttgggc cacttccaat cggataggat gcgcaattca tacttgccaa aacatgaatg 1080 
tttggggatc tgtgtggcga cgtgcagttt acttggtatg caactatgcc ccaaagggca 1140 
attggattgg agaagcacca tataaagtag gggtaccatg ttcatcttgt cctccaagtt 1200 
atgggggatc ttgtactgac aatctgtgtt ttccaggagt tacgtcaaac tacctgtact 1260 
ggtttaaata agtttacctt ttcctccagg aaatataatg atttctggga acatgggcat 1320 
gtatatatat atatggagag agaattttgc acatattata catattttgt gctaatcttg 1380 
ttttcctctt agtattcctt tgtataaatt agtgtttgtc tagcatgttt gtttaatcct 1440 
ttgggaattc % I . 1450 

<210> 114 
<211> 793 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 449393 . 1 : 2000MAY01 
<400> 114 

cagccatgtc gctccgcgtg ctgaacccta acgccgaggt gctcaacaag tcggcagcgc 60 
tgcacatgaa catcaacgcc gccaagggcc tccaggacgt gctcaagacc aacctcggcc 120 
ccaagggcac catcaagatg cttgtgggtg gagctggtga cctgaagctg acgaaggatg 180 
ggaacaccct cttgaaggaa atgcaaattc agaacccaac tgcaatcatg attgcgagga 240 
cggctgtggc acaggatgac acaagtggtg atggcacaac ctccactgtg ctatttatcg 300 
gggagctaat gaagcagtct gagcggtgca tcgatgaagg aactcatcca agattcttgg 3 60 
ttgatggttt tgatgttgct aagagagcgt gtcttgattt tctcgataag ttcaagacgc 420 
cagtcgttac tggtgaagaa cctgacagag ataccttaaa aatggtagca agaacaacac 480 
ttaggacaaa gttgtatgaa gggttggctg atcagttgac tgatattgtt gtgaatgcag 540 
ttctgtgcat tcgcaaacca gatgaaccaa ttgatctatt tatggtggaa ataatgcaca 600 
tgcgccacaa atttgatgtt gacacccgtc tggttgaggg tctggtcctg gaccacggtt 660 
ctcgtcaccc tgacatgaag cgcagggcag agaattgtta catcttgacc tgcaatgtct 720 
cattagaata tgagaaaagt gaaatcaatg caggattttt ttactcaaac gcagaacaaa 780 
aaaaaaaaaa agg 793 

<210> 115 
<211> 2494 
<212> DNA 
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<213> Homo sapiens 
<220> 

<221> misc__f eature 1 

<223> Incyte ID No: LI : 1071427 . 96 : 2000MAY01 

<400> 115 

atgaaagtca ggcttttatt gagatgagac aagagaagat gcaatggcaa tggttgacca 60 
ttgtttgaaa aaagcccttt ggtttcaggg gagatgtgtg aaggttgacc tgtctgagaa 12 0 
atataaaaaa ctggttctga ggattccaaa cagaggcatt gatttactga aaaaagataa 180 
atcccgaaaa agatcttact ctccagatgg caaagaatct ccaagtgata agaaatccaa 240 
aactgatggt tcccagaaga ctgagagttc aaccgaaggt aaagaacaag aagagaagtc 3 00 
cggtgaagat ggtgagaaag acacaaagga tgaccagaca gagcaggaac ctaatatgct 3 60 
tcttgaatct gaagatgagc tacttgtaga tgaagaagaa gcagcagcac tgctagaaag 42 0 
tggcagttca gtgggagacg agaccgatct tgctaattta ggtgatgtgg cttctgatgg 480 
gaaaaaggaa ccatcagata aagctgtgaa aaaagatgga agtgcttcag cagcagcaaa 540 
gaaaaagctt aaaaagcgtc gttttccagg gagtatggaa ggttttgtca ctctagatga 600 
ggttggtgat gaggaagatt cggaacttca gaaacttcgt aaatcgggca tggcatttaa 660 
atctggtgac aaaaatgatg atggtttggt tgaaattaag gtggacaaga tcgaggaact 726 
tgatcaagaa aacgaagcag cgttggaaaa tggaattaaa aatgaggaaa acacagaacc 780 
aggtgctgaa tcttctgaga acgctgatga tcccaacaaa gatacaagtg aaaacgcaga 840 
tggtcaaagt gatgagaaca aggacgacta tacaatccca gatgagtata gaattggacc 9 00 
atatcagccc aatgttcctg ttggtataga ctatgtgata cctaaaacag ggttttactg 960 
taagctgtgt tcactctttt atacaaatga agaagttgca aagaatactc attgcagcag 1020 
ccttcctcat tatcagaaat taaagaaatt tctgaataaa ttggcagaag aacgcagaca 1080 
gaagaaggaa acttaagatg tgcaaggaga tttaatgatt tcaaagaaaa tgatggttct 1140 
ttgtttttaa tgttaacctt ttttaaatac aatactgata gaagaaaact attgtactct 1200 
tttgttttag tggagaaata atagatgtct gttcatgtgt taagtgttat agcaaaaaaa 1260 
atacacatat ggttaagtta atgaatagtt tttgttttat cagaatggca. acagacagaa 1320 
gtactttgta gagattgact tcctaagcta cttaagacaa cttgcaccac taagaaaaaa 1380 
atgtagaacc atttggaaaa atgaaattta gtagttccaa gtttcaaaga aatgtcaaca 1440 
ttttattcca ttcaataaag aacaaaacca atagtgtttt tattactttc atctgaaaca 1500 
ttccatgttt taatctgagc cttgcagact ttcatttgga gtttgaaccc gttttggttg 1560 
catttcattt ttggagaact taattaacgt gagattggca attgaaatgc aggtgcagtt 1620 
ttctgttaat gtcatgctgt tgtttaggta ataagaaata ttaagtaatt ggctttagat 1680 
tttgtaattt ttttccctga gttcctgcta gatttcgtat tctagtagtc aatgtatttt 17*40 
cagtgaaatg caaaaatatt cccattatct ttgaccagta ttaatttttg agatcttact 1800 
gcttgtcact tgaatcccgt gattgtcata catctctggt ataagcaaca tttgattttt 1860 
gaagtgtgta gaccatctct tcatattttc aagatgtaat tttacatttc tgcattttta 1920 
aaacagtttg gccataatcc tagatgcacg cttctaattc atgtacctgc acatgtgacc 1980 
tttgtgaaca gaaatttgca tgtataattt gtgtttactt gtaactttct ggttatatac 2040 
tgcttatatc tgtggattca agttactgaa gtgaatacca ataaaaagaa aaccctaggc 2100 
catgttaatt ggttatacat gtttggaatg ttaaccaacg tatttgtcag ttgtggtttt 2160 
tattcgctct taaactttgt gcatgcttta acaatttatt acttttaaat ctagagtgaa 2220 
ttctaaagac tgccgctaaa gatctgagtt ttaaaaatgt tgttgctggt ggatttcttg 2280 
ttcctgttac ataactaaaa gtgaggccat ttgtggtttt taaaaacctt atgaattaaa 2340 
aatgctacag gtgaacaaac agaagcttat gtttagagag attgatgact taacagtaca 2400 
attggaagta atacggatga gcaagaatta gttctgcagc ttttcaaaat aattacgtag 2460 
agactcttgg tatattggat tatctgttgt aaac 2494 

<210> 116 
<211> 957 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 33 6338 . 8 : 2000MAY01 
<220> 

<221> unsure 

<222> 684, 708, 743 

<223> a, t, c, g, or other 

<400> 116 

gcatcttata atatgtggca gtttacaact gctttgacca gaaaactgat gtgatatcag 60 
tgttactgcc cagatatgag agttggatgc agagacatct gttcaaccga gttaggcaca 120 
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ttgfcgl 



tgactacttc ttcttgtgtt 
ctaaagaggt ttgaaaatta 
catggaaaca aacaaaacag 
aattaaaatt ctaattgata 
ctttgtgtta cttaaaatga 
tggtataaat gactgtatac 
tgtgttatta. attgatttct 
ggataccaat tacatttacc 
ggttgactga aatcacttgg 
ctcttccaat tccccaaatc 
atcatgctigc tctcgtcttc 
aagagtcctg ggcggaagga 
ggaaggttct tcaacaactt 
tagatttctt tgttcctgta 

<210> 117 

<211> 1269 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> mis cofeature 

<223> Incyte ID NO: LG:345527 . 1 :2000FEB18 



gaatttaatt 
aactattgtt 
aatgttacat 
agacaatatt 
tggtgcatct 
tctctgctct 
aatttattct 
atgggactgc 
ggtggaccaa 
tgcntctctg 
aantcccctt 
ggcttctgta 
tgctgcaaaa 
gacttataga 



cttcttcttg 
caatattcga 
aagtataaca 
gtgtcacagt 
taaatcttgt 
aggtatctca 
ttttctaatc 
aatgtctctc 
atttcctcct 
gccaatcaca 
ctctcttgct 
tgcttgtgaa 
catgtacatt 
ttattccata 



tgttgaattt 
aggaaagatt 
aatactttta 
ttaatatgaa 
gtcttagatt 
aagaagacaa 
tttttttttt 
ctctccttga 
gttgactact 
accttctncc 
gacttctgga 
aaagaacaat 
t.caaggctga 
ttattaaaaa 



PCT/US01/06059 

aattgccgtt 180 
cttgcataag 240 
ataagtctaa 300 
gcatcttttc 360 
caatgaaata 420 
atttcttgct 480 
tttcacaaat 540 
aaaacaatgc 600 
cctgaatgat 660 
atgtgtgtac 720 
aaagcaacgg 780 
ctgtgccaac 840 
gcagccatct 900 
gaaaaaa 957 



<400> 117 

cagggagtta 

aaacaattta 

cttgaaaagc 

tattgctaaa 

cttattgaga 

cgtttatgct 

aattgcaaac 

accaaaacga 

attggaaaaa 

agtgaatatc 

tcgtaaatgt 

atttaagatc 

gctgccagaa 

gttggacatt 

tccatgtctt 

tttatcttct 

acagttatca 

tgtcatagaa 

atcgatcaga 

cataaaaatc 

cctaaggaat 

agagaggtc 



aaacgattca 
tcagaacttt 
tttgagtctt 
gaaagagaac 
agactgaagt 
ccactttcaa 
atgtttggat 
cgaactagaa 
ctgatcagtc 
cctgtagaat 
tgtaatcatc 
gatgaagaat 
ctaaaaaaaa 
ttgatggatt 
actcagagag 
tagtgagtac 
tttatgatag 
ttggtcagac 
aaattgtgga 
atttcaaagg 
taatggatta 



atgctgataa 
ggtcattgct 
ggtttgacat 
agaatgtatt 
ctgatgttgc 
agaagcagga 
ccagtgagaa 
aatcaataaa 
aaatacagcc 
ctgaagttaa 
catatttgat 
tggtaacaaa 
gaggtcacaa 
actgccatct 
agaaaaaaac 
acgagctggt 
tgattggaac 
aaagccagtt 
aagagcagct 
tggtcagtct 
ttaaaatcta 



caaacttctt 
aaactttttg 
cactagtctt 
gcatatgctg 
tcttgaagtt 
gatcttttat 
agaaacaatt 
ttacagcaaa 
agaggtggac 
tctgaagctg 
tgaatatcct 
ttctgggaag 
ggtgctgctt 
cagagatttc 
atgcacagct 
ggcctgggca 
ccccagtcgg 
gttgtttatc 
gctaaaagga 
ggattaaatc 
gagattatga 



ttgactggta 
ttgccagatg 
tctgaaactg 
caccagattt 
cctcctaaac 
acagccattg 
gagttaagtc 
atagatgatt 
cgagaaagag 
cagaatataa 
atagaccctg 
ttcttgattt 
ttttcacaaa 
aacttcagca 
tcaacacgga 
ttaatctgac 
atcttcaggc 
gccttgttac 
aactggaaaa 
tgtctaagaa 
aagggaaata 



ctccc ttgca 
tatttgatga 
ctgaagatat 
taacaccttt 
gagaagtagt 
tgaaccgtac 
ctactggtcg 
tccctaatga 
ctgttgtgga 
tgatgctact 
ttacacaaga 
tggatcgaat 
tgacaagcat 
ggctgatggg 
tccagaggtg 
tgcagcagat 
ccaggataga 
agcaaatact 
gttgatcatc 
tttcttagat 
aaaggatcag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1269 



<210> 118 
<211> 512 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 1089383 . 1 :2000FEB18 



<400> 118 

cctgtagcca 

ggttttgctg 

aaaaggaaga 

gctcacacaa 

agacccggga 

gaggagatcc 

atttcaaatc 

agaacctgac 

gtaccccaac 



ctgaggttca 
cttattgagt 
gatgtcttca 
agcctgtttg 
caggaggact 
acctacgacc 
ggattcccaa 
ccctcaaact 
tgccgtccgc 



caggctctgt 
gatgaagagc 
gctcaaatgc 
gtagtctctt 
ccttcaggag 
tcaggtcctc 
ctatatgaag 
ctacaacctc 
ctgcaggatc 



ttgaggtgtt 
aacatattag 
tatcaaactg 
cacatggacg 
acgggtcccc 
agaccaacca 
acaccctagc 
gatggactgg 
ct 



gtgttagatt 
ctactgagat 
gtgaaataaa 
cgtgtgacat 
tgtccttgcc 
gcccaaggaa 
tggacgatca 
accctactta 



gtgctgggtt 60 
attttgcaga 120 
cagcgttgtt 180 
ttggtgctga 240 
ctcactccgt 300 
catctcatga 3 60 
gttcttatta 420 
gtcatctata 480 
512 
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<210> 119 
<211> 530 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_featuire 

<223> Incyte ID No: LG: 1092522 . 1 : 2000FEB18 
<400> 119 

ggctaatgac ttattacttg ctgtttatat ttattttgga ctatggaaat gatgctagac 60 
aaaaagcaaa gttgagtgat tttcttattt aagttcaaag tgggtcgtaa aacagcagag 120 
acaacttgca atatcaacaa tgcatttggc ccaggaactg ctaatgaacg tacagtgcag 180 
tggtggttca agacactttt ca'aaggagat gagaaccttg aagatgagga gcatagtggc 240 
caaaccactg gaagttgaca atgaccaact gagagcaatc agcaaagctg atcctcttaa 3 00 
ggctacttga gatgttgccg aagaactcaa tgtggaccat tctacagtca tttggcattt '3 60 
gaagcaaatt ggaagagtga aaaagctcga taagtgggtg cctcatgagc tgagtgaaaa 420 
tttaaaaaat agtcattttg aagtgtcctt atcccttatt ttttgcaaca acaaaccatt 480 
tcttggtcag attgtgatgt gcgagaaaaa gtggatttta tacaacaact 53 0 

<210> 120 

<211> 671 

<212> DNA i 

<213> Homo sapiens 

<220> ' 
<221> misc_f eature 

<223> Incyte ID No: DG:1093216 . 1 :2000FEB18 
<220> 

<221> unsure 

<222> 436, 454, 499, 554 
<223> a, t, c, g, or other 

<400> 120 

aactgttttc ctttgaggaa tattgatgat cccactgctt gagtaatgga aaccagcttg 60 
cagctgatag ttattfctttg tcttttttcc tgttctatcc actgtcctag gtttggaatt 120 
tcagctgggc agcttcgatc ccaccacctg tagccactga ggttcacagg ctctgtttga 180 
ggtgttgtgt tagattgtgc tgggttggtt ttgctgctta ttgagtgatg aagagcaaca 240 
tattagctac tgagatattt tgcagaaaaa ggaagagatg tcttcagctc aaatgctatc 300 
aaacttgtga aataaacagc gttgttgctc acacaaagcc tgtttggtag tctcttcaca 3 60 
tggacgcgtg tgacatttgg tgctgaagac ccgggacagg aggactcctt caggagacgg 420 
gtcccctgtc cttgcnctca ctccgtgagg aganccacct acgacctcgg gtcctcagac 480 
caaccagccc aaggaacanc tcatgaattt caaatcggat tcccaactat atgaagacac 540 
cctagctgga cgancagttc ttattaagaa cctgacccct caaactctac aacctcgatg 600 
gactggaccc tacttagtca tctatagtac cccaactgcc gtccgcctgc aggatcctcc 660 
ccactgggtt c 671 

<210> 121 
<211> 257 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 27 03 18 . 3 : 2000FEB01 
<400> 121 

ttctttgaga ctatttaaaa actggtacaa caggtctcta caacgccaag atctaactaa 60 
gctttaaaag ggatagctgg gagtatggcc accctgctcc acgatgcggt aatgaatoca 120 
gcagaagtgg tgaagcagcg cttgcagatg tacaactcgc agcaccggtc agcaatcagc 180 
tgcatccgga cggtgtggag gaccgagggg ttgggggcct tctacggagc tacaccacgc 240 
agctgaccat gaacatc 257 

<210> 122 
<211> 1353 
<212> DNA 
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<213> Homo sapi€ 



<220> 

<221> misc_f eature 

<223> Incyte ID No: LI : 335671 . 2 :2000FEB01 
<400> ,122 ' 

gggaagtgaa tatccctgta gaatctgaag ttaatctgaa gctgcagaat ataatgatgc 60 
tacttcgtaa atgttgtaat catccatatt tgattgaata tcctatagac cctgttacac 120 
aagaatttaa gatcgatgaa gaattggtaa caaattctgg gaagttcttg attttggatc 180 
gaatgctgcc agaactaaaa aaaagaggtc acaaggtgct gcttttttca caaatgacaa 240 
gcatgttgga cattttgatg gattactgcc atctcagaga tttcaacttc agcaggcttg 300 
atgggtccat gtcttactca gagagagaaa aaaacatgca cagcttcaac acggatccag 360 
aggtgtttat ctcttagtga gtacacgagc tggtggcctg ggcattfaatc tgactgcagc 420 
agatacagtt atcatttatg atagtgattg gaacccccag tcggatcttc aggcccagga 480 
tagatgtcat agaattggtc agacaaagcc agttgttgtt tatcgccttg ttacagcaaa 540 
tactatcgat cagaaaattg tggaaagagc agctgctaaa aggaaactgg aaaagttgat 600 
catccataaa aatcatttca aaggtggtca gtctggatta aatctgtcta agaatttctt 660 
agatcctaag gaattaatgg aattattaaa atctagagat tatgaaaggg aaataaaagg 720 
atcaagagag aaggtcatta gtgataaaga tctagagttg ttgttagatc gaagtgatct 780 
tattgatcaa atgaatgctt caggaccaat taaagagaag atggggatat tcaagatatt 840 
agaaaattct gaagattcca gtcctgaatg ' tttgttttaa agtggagctc aagaatagct 900 
tttaaaagtt cttatttaca tctagtgatt tccctgtatt gggtttgaaa tactgattgt 960 
ccacttcacc ttttttatca tatcagttga catgtaacta gtaccatgcg tacttaaata 1020 
gatggtaatt ttctgagcct taccaagaac aaagaagtat ccatattaag tttagatttt 1080 
cagttaattt ttgagactga gtagtattct tggatacagg ctgatgtgta cttaaccact 1140 
tccagattta tacagtcttc qtgtggaagt ttagtaaatg tctttttccc tcctttcttc 1200 
tagtaatgca gttcatgggc tttaggtact tcagttatga agtaggcttt tcatggggag 1260 
agattgggat tatgctctct gttgtttaag aaactgtttg attttagagt ctatttctat 1320 
gagatagttt accaaataaa tgttccttat aag 1353 

<210> 123 
<211> 671 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI :793758 . 1 : 2000FEB01 
<220> 

<221> unsure 
<222> 554 

<223> a, t, c, g, or other 
<400> 123 

aactgttttc ctttgaggaa tattgatgat cccactgctt gagtaatgga aaccagcttg 60 
cagctgatag ttattttttg tcttttttcc tgttctatcc actgtcctag gtttggaatt 120 
tcagctgggc agcttcgatc ccaccacctg tagccactga ggttcacagg ctctgtttga 180 
ggtgttgtgt tagattgtgc tgggttggtt ttgctgctta ttgagtgatg aagagcaaca 240 
tattagctac tgagatattt tgcagaaaaa ggaagagatg tcttcagctc aaatgctatc 300 
aaacttgtga aataaacagc gttgttgctc acacaaagcc tgtttggtag tctcttcaca 360 
tggacgcgtg tgacatttgg tgctgaagac ccgggacagg aggactcctt caggagacgg 420 
gtcccctgtc cttgccctca ctccgtgagg agatccacct acgacctcgg gtcctcagac 480 
caaccagccc aaggaacatc tcatgaattt taaatcggat tcccaactat atgaagacac 540 
cctagctgga cgancagttc ttattaagaa cctgacccct caaactctac aacctcgatg 600 
gactggaccc tacttagtca tctatagtac cccaactgcc gtccgcctgc aggatcctcc 660 
ccactgggtt c 671 

<210> 124 
<211> 512 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc__f eature 

<223> Incyte ID No: LI : 803718 . 1 : 2000FEB01 



58/228 



BNSDOCID: <WO 0162927A2_I_> 



WO 01/62927 



PCT/US01/06059 



<220> 

<221> unsure 
<222> 290 
<223> a, t, c, 



g, or other 



<400> 124 

cctgtagcca 

ggttttgctg 

aaaaggaaga 

gctcacacaa 

agacccggga 

gaggagatcc 

atttcaaatc 

agaacctgac 

gtaccccaac 



ctgaggttca 
cttattgagt 
gatgtcttca 
agcctgtttg 
caggaggact 
acctacgacc 
ggattcccaa 
ccctcaaact 
tgccgtccgc 



caggctctgt 
gatgaagagc 
gctcaaatgc 
gtagtctctt 
ccttcaggag 
tcaggtcctc 
ctatatgaag 
ctacaacctc 
ctgcaggatc 



ttgaggtgtt 
aacatattag 
tatcaaactg 
cacatggacg 
acgggtcccc 
agaccaacca 
acaccctagc 
gatggactgg 
ct 



gtgttagatt 
ctactgagat 
gtgaaataaa 
cgtgtgacat 
tgtccttgcn 
gcccaaggaa 
tggacgatca 
accctactta 



gtgctgggtt 60 
attttgcaga 120 
cagcgttgtt 180 
ttggtgctga 240 
ctcactccgt 3 00 
catctcatga 3 60 
gttcttatta 420 
gtcatctata 480 
512 



<210> 125 
<211> 782 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: hi : 412179 . 1 : 2000FEB01 



<400> 125 

aggcagcatg 

acccctaatt 

tctccatctg 

tgaagggcat 

caagatgagt 

cactgtgccc 

acatggtaaa 

gaattgttgg 

atacatcatt 

ttattttatg 

tttcttattc 

tgcatttggc 

agcgtagtgg 

at 



acaacatgat 
agcttcttga 
catgatggat 
cctgggacac 
tggtcactta 
tgcatctaat 
agaaaaataa 
aatttgctgt 
ttaatggaca 
tgtatttaaa 
gagttcaaaa 
ccaggactgc 
tgggccattg 



ggacagagca 
gctcagctaa 
tattttatag 
agaacttcca 
gaggacagat 
agctgctcta 
aatgaaacaa 
ataatactgg 
tttctcgctt 
ctatagaaat 
tgggtcgtaa 
taacgaatgt 
gaagttgaca 



cggattttct 
gttcttaaaa 
gttgaccact 
gtgctaaaac 
gaattgaaat 
taaatgtagt 
ataggtgagt 
aatacattct 
tatgtttgct 
gatgttagac 
aatagcagag 
acagtgcagt 
atgacaaact 



gagggataca 
cttctctgag 
cgtcctggtt 
ccagaaaatc 
gacatataga 
cacacaaaaa 
gcaaaataat 
taaataaatg 
aatgacttat 
ataaagcaaa 
acaactcgta 
ggcagttcaa 
aagagcaatc 



gccaggtccc 
cccttgattt 
tgcctgagac 
ccaggcatat 
aatggactca 
cgtgctcctc 
tgcagttttt 
tggttatgtt 
cacttgctgt 
ttcaagtgat 
acatcgacaa 
ggagacaagg 
attgaagctg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

782 



<210> 126 
<211> 1937 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 815679 . 1 : 2OO0FEB01 



<220> 

<221> unsure 
<222> 635 

<223> a, t, c, g, or other 



<400> 126 

ccaggtctac atgtataaaa ttttttaaca 
taagtcttca caatctaatg tgtatttgac 
agcccatatg gggggcccgc ccatgatgcc 
gccagtggga cctgctcttg gaatgaggcc 
tgggtgccca atgataagac ctcctgcccg 
gacttgacca gacagataag gatagagggg 
ttgttattgt tgtgtttttt ttgtttgtaa 
cctctgtcgc ctaggctgga gggcagtggc 
ccgggttcaa gcgattcccc tgcctcggcc 
caccatgccc cggctaattt tttttattta 
atggtctcga tctcctgacc tcgtgactcg 



gtaacaaaat actttgcttt ctttttatat 60 
acttctcgca catttcagaa tgatggcagc 120 
aaagatgggc cctcctcctc ctgggatgat 180 
gcccatggga ggccacatgc ctgtgatgcc 240 
tctcatgatg gtgcccagtc agcccagaat 300 
aggcctcatt gcatcagtgt tgttttgttg 360 
tgttttgttt tgtttttgag acagagtctt 420 
gtgatctcgg ctcactggaa cctccgcctc 480 
tcctgagtag tgtgggactg caggcgtatt 540 
gtggagacgg ggtttcaccg tgttggocag 600 
ctcgntcagc ctctgaaggt gctgggattg 660 



-59/228 



BNSDOCID: <WO_0162927A2_I_> 



WO 01/62927 



f 

sactgc 



PCI7US01/06059 



caggtgtgaa gccacTgcgc ctggcctata tgaattttat 
aggagatcat gctgctgtga tatcgagttt tcttaacagc 
tgccctatca aagagaatag ttttggaggg gagaagtggg 
catttgtatt gggaaatgtg aaaataaaat tgccaactgc 
aagaaaagga aacaagatgt ggggctgcca tatgcaatac 
tactctggag gcaaatatta . tcttcgctga agccagacca 
gtttttttaa- tgtgactgtg ttttatttta gaatgtgtaa 
tacctgtctg gggaagacta tttaatttcc tgcatttatt 
ttatgaaggg aaacagctct aacaactgag tgtcccccac 
cacggggcaa aggaattgaa cagcagcctc ctaatagccg 
ataattatca gcatgtaaaa gactatatat atattcaaca 
tcaaatctac aactgatttg cttcctgggc tcctgaaaac 
gaaatataat cagccaatcg ttgccccttg gggacgcagg 
ccaatgtgga gtcggccgtg cacaatgaca tgcagacttg 
atggtccctc cccagtcagg cccccattgc ctgggctgca 
agtgcattca acaaatactt atttaattgt attggtggtt 
ggtacattaa tggatccatg tcctccctgt atccaagact 
tcctcccact gaagaatcgg agtatatttc tccagtccct 
gtctagcttt ggccactgga atattaatct gtatgaccaa 
catttgtgct cgctcactcc tgctatcacc atgagaacaa 
tccagcagaa gataagagac accaagaaca aagtcgagct 
gactgaccaa tcctcag 

<210> 127 
<211> 504 
<212> DNA 
<213> Homo sapiens 



atttacctgc 
ataaggaaga 
accaaaaaag 
tttagttaaa 
cgtggattct 
acctgacaca 
ttcactttag 
tagaatgctg 
atagacacag 
gccttctttg 
attctgaccc 
aactttgtca 
ataaagcaag 
caggacatcg 
gccagcagca 
agagggttgt 
ctgccatttg 
aatgttgggt 
aaacttggaa 
gcctaggcca 
tcccagacat 



tcccttcacc 
cttgccctct 
atgcagtttt 
aacaaaaaaa 
acagatcttc 
aaggcctttt 
aagggcaaag 
gctgatgtta 
ctcatgagtt 
tgatgtggaa 
cctgcaaaat 
aaattgttca 
tcatccatga 
agtccctgct 
ttcaggcaca 
gattgattaa 
tctctgcagt 
tttgaattgt 
agtgtgaatjt 
ggctgetgct 
gctcatgcta 



720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1937 



<220> 

<221> misc_f eature 
<223> Incyte ID No: 



LI: 481361. 3 :2000FEB01 



<400> 127 

ggagccctgg 

agtgccccct 

agtccatggc 

atgtccaagc 

ttgccagcct 

ggaagcagtt 

ataaaatcag 

aaaatgaagg 

aagaaaagaa 



ggatatgatg 
gcaggagatg 
cctggctgac 
ttatgcaaag 
tgct'cgaggt 
tccagatttt 
agaaaagcag 
gagaagaaaa 
gaaaaaaatg 



ccagccaggg 
aagccccaga 
agagctgtgt 
catgaatgca 
tttgccctgc 
gtgcccgtgg 
aggcagaaac 
ttcataaaaa 
aatg 



aaggtgagaa 
gaaacacagc 
ttgaaaaggg 
acctgatttt 
tgaggatgcc 
acgttaatac 
tcctggagca 
ataaagcttg 



tgtggtatgt 
ggaccttctg 
catgaaagct 
cagattaaag 
caagatgcca 
cgacacgatt 
acaaagaaga 
gtcaaagcag 



gtgtgtgtgt 60 
ccaaaactca 120 
tttgtgtcat 180 
gatcttgatt 240 
gaattgagag 300 
ccatttaaag 360 
gagaaaacag 420 
aaggccaaaa 480 
504 



<210> 128 
<211> 971 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
<223> Incyte ID No: 



<220> 

<221> unsure 
<222> 46 
<223> a, t, c, 



LG:247388.1:2000MAY19 



g, or other 



<400> 128 

aacaggaaac 

ttttggagaa 

tacttaacac 

ctgaagaaaa 

cacctgcagg 

aagcttaaag 

atcctagagg 

agaggcaacc 

gtcttcttac 



aattattgaa 
gaatttctcc 
aatgatggag 
aatcctaaca 
ccatggttcg 
ctggtggcca 
attatttaat 
ttcgacaggc 
tgtgtacccg 



gtagagctga 
ttcctttcca 
ttgcgcaagt 
gaattccgtg 
ttcagccggc 
taaagttctg 
ccagaggagg 
tgccattgac 
ggctggtgga 



ctaatatcca 
aaggggcagg 
gctgcaacca 
aagcttgcca 
aaactggttc 
atcttctctc 
tacttatatg 
cgcttcagca 
cttggtatta 



gaaganatac 
tcataccaac 
cccatatctc 
tattatacct 
ttattgacaa 
agatggtacg 
aacgtattga 
agcctgactc 
atcttacagc 



tatcgggcta 60 
atgcctaatc 120 
atcaatggtg 180 
catgaccttt 240 
gttgcttcca 300 
ctgcctagac 360 
tgggcgagtt 420 
agaccgcttt 480 
tgctgatacc 540 



60/228 



BNSOOCID: <WO 0162927A2J_> 



WO 01/62927 



tgcatcatct 
catcgaattg 
gagagagaga 
tccatgagtg 
gaagatcttt, 
aagttttgtg 
gaatctgaag 
acagatattt 



ttgattcaga 
ggcagagcaa 
tgtttgataa 
gtcgggatgg 
taagaaaagg 
aagaggacat 
gaaaaggttc 



ctggaatcca 
agctgtgaag 
ggccagcctc 
caacattact 
agcatatgca 
tgaccagatc 
cacctttgct 



caaaatgacc 
gtgtaccgcc 
aagttggggt 
ggaatccaac 
gccatcatgg 
ttgttaagac 
aaggcaagct 



tgcaggccca 
tcatcactcg 
tggataaggc 
agttctctaa 
aggaagatga 
gaactacaac 
ttgttgcttc 



PCT/US01/06059 

ggcacgatgt 600 
taattcctac 660 
tgtgcttcaa 720 
gaaggagatt 780 
tgaaggctcc 840 
catcaccatt 900 
tgaaaacagg 960 
971 



<210> 129 
<211> 192 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
<223> Incyte ID No: 



LG:255789 . 10 : 2000MAY19 



<400> 129 ' 

aactaaaaga attgacaatt ttattttcac atttcccaat acaaatgaaa actgcatctt 60 

ttttggtccc acttctcccc tccaaaacta ttctctttga tagggcaagg gggcaagtct 120 

tccttatgct gttaagaaaa cccagcatca cagcccatga tctcctggtg aagggagcag 180 

gtaaatataa ag 1 192 

<210> 130 ' 
<211> 2002 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 787618 . 1 : 200 0MAY01 
<220> 

<221> unsure 
<222> 635 
<223> a, t, c, 



g, or other 



<400> 130 

ccaggtctac 

taagtcttca 

agcccatatg 

gccagtggga 

tgggtgccca 

gacttgacca 

ttgttattgt 

cctctgtcgc 

ccgggttcaa 

caccatgccc 

atggtctcga 

caggtgtgaa 

acccaggaga 

ctcttgccct 

ttttcatttg 

aaaaaagaaa 

cttctactct 

ccttttggtt 

gggcagaagt 

ctggctgatg 

cacagctcat 

ctttgtgatg 

gaccccctgc 

ttgtcaaaat 

aagcaagtca 

gacatacgag 

tagaagcatt 

agggctgttg 



atgtataaaa 
caatctaatg 
ggtggcccgc 
cctgctcttg 
atgataagac 
gacagataag 
tgtgtttttt 
ctaggctgga 
gcgattcccc 
cggctaattt 
tctcctgacc 
gccactgcgg 
tcatgctgct 
atcaaagaga 
tattgggaaa 
aggaaacaag 
ggaggcaaat 
ttttaaatgt 
acctggtctg 
gttatctatg 
gagttcaagg 
tggaaataat 
aaaattcaaa 
tcgttcagaa 
gccatgacca 
tccctgctat 
caggcacaag 
ttgattaagg 



ttttttaaca 
tgtatttgac 
ccatgatgcc 
gaatgaggcc 
ctcctgcccg 
gatagagggg 
ttgtttgtaa 
gggcagtggc 
tgcctcggcc 
tttttattta 
tcgtgactcg 
cctggcctat 
gtgatgtcga 
atagttttgg 
tgtgaaaata 
atgtggggct 
attatcttcg 
gactgtgttt 
ggggaagact 
aagggaaaca 
ggcaaaggaa 
tatcagcatg 
tctacaactg 
atataatcag 
atgaggagtc 
ggtccctccc 
tgcattcaac 
tacattaatg 



gtaacaaaat 
acttctcgca 
aaagatgggc 
gcccatggga 
tctcatgatg 
aggcctcatt 
tgttttgttt 
gt gate t egg 
tcctgagtag 
gtggagacgg 
ctcgntcagc 
aatgaattta 
gttttcttaa 
aggggagaag 
aaattgacaa 
gecatatgea 
ctgaagccac 
tattttagaa 
atgttaattt 
gctctaacaa 
ttgaacagca 
taaaagtcta 
atttgettec 
ccaatcgttg 
ggccgtgcac 
cagtcaggcc 
aaataettat 
gat-ccatttc 



actttgettt 
catttcagaa 
cctcctcctc 
ggccacatgc 
gtgcccagtc 
gcatcagtgt 
tgtttttgag 
ctcactggaa 
tgtgggactg 
ggtttcaccg 
ctctgaaggt 
taatttaacc 
cagcataagg 
tgggaccaaa 
ctcttttagt 
ataccgtgga 
gaccaacctg 
tgtgtaattc 
cctgcatgtt 
ctgagtgtcc 
gcctcctaat 
tatatatatt 
tggggctcct 
ccccttggga 
aattacatgc 
occactgcct 
ttaattgeat 
ctccctgtat 



ctttttatat 
tgatggcagc 
ctgggatgat 
ctgtgatgcc 
ageccagaat 
tgttttgttg 
acagagtctt 
cctccgcctc 
caggegtatt 
tgttggccag 
gctgggattg 
tgctcccttc 
aagacttgee 
aaagatgcag 
taaaaacaaa 
ttctacagat 
acacacaagg 
actttaggaa 
atttagaatg 
cccacataga 
agccggcctt 
caacaattct 
gaaaacaact 
cegcaggata 
agacctgeag 
gggctgeage 
tggtggctag 
ccaagactct 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

12-60 

1320 

1380 

1440 

15O0 

1560 

1620 

1680 



61/228 



BNSDOCIO: <WO 0162927A2_I_> 



PCT/US01/06059 

tccagcccct 1740 
acatgaccaa 1800 
aaaagaacaa 1860 
aagtcaagct 1920 
tccatgaaaa 1980 
■ 2002 

<210> 131 

<211> 8277 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 33 1610'. 2 :2000MAY01 
<220> 

<221> unsure 

<222> 4213, 4217, 4222 

<223> a, t, c, g, or other 

<400> 131 

agcaatgcaa gattagatct ctaaatgcag caaaacactg cctgaaaaca gaccggcccg 60 
cgcagcaagc agacatttca cggtgcgctg gggaagcttc aaaatatatc tgtgactctg 120 
tcttcgttgc tcttcatccc catcaatttc atcacgggag gcgagcagca agtaagaatt 180 
tcactttcgg atctgcctag agacacacct ccctgctccc tcccccactc gatgtgaaga 240 
gtattccgga gtctccgggc gggagtagat ttgcagcacc cctaagcggg agcgagggaa 300 
aacctactga ttctttagct cattatcatc tctcccagac gagatttcct tcttatcgcc 3 60 
tgcctcatcg ctcaagtttg agcctcccga agtccgggcg ggagagacga aacccctggc 420 
tcacccccag ccgcaggaag ccaccgcctt gctccaagcc cctgcagctc tgctgcaccg 480 
cagcttctca cccagtgcgg atgctgtaga tcaacaggtt cagggaactt ' gagcagaata 540 
aggagagacc accgggtgcc gcagctcggg* tgcagaggga aaaaaggacc catagacttg 600 
tggctcgcgt cgcgcgcgca cgctgcgcca ggggcccagg ctggggcgca ctccctctct 660 
gggtcctcca gtccgattgc cctgccccca ccttacaggt ctgggatgta cctttccatc 720 
tgttgctgct ttcttctatg ggcccctgcc ctcactctca agaacctcaa ctactccgtg 780 
ccggaggagc aaggggccgg cacggtgatc gggaacatcg gcagggatgc tcgactgcag 840 
cctgggcttc cgcctgcaga gcgcggcggc ggagggcgca gcaagtcggg tagctaccgg 900 
gtgctggaga actccgcacc gcacctgctg gacgtggacg cagacagcgg gctcctttac 960 
accaagcagc gcatcgaccg cgagtccctg tgccgccaca atgccaagtg ccagctgtcc 1020 
ctcgaggtgt tcgccaacga caaggagatc tgcatgatca aggtagagat ccaggacatc 1080 
aacgacaacg cgccctcctt ctcctcggac cagatcgaaa tggacatctc ggagaacgct 1140 
gctccgggca cccgcttccc cctcaccagc gcacatgacc ccgacgccgg cgagaatggg 1200 
ctccgcacct acctgctcac gcgcgacgat cacggcctct ttggactgga cgttaagtca 1260 
cgcggcgacg gcaccagagt tcccagaact ggtcatccag aaggctctgg accgcgagca 1320 
acagaatcac catacgctcg tgctgactgc cctggacggt ggcgagcctc cacgttccgc 1380 
caccgtacag atcaacgtga aggtgattga ctccaacgac aacagcccgg tcttcgaggc 1440 
gccatcctac ttggtggaac tgcccgagaa cactccgctg ggtacagtgg tcatcgatct 1500 
gaacgccacc gacgccgatg aaggtcccaa tggtgaagtg ctctactctt tcagcagcta 1560 
cgtgcctgac cgcgtgcggg agctcttctc catcgacccc aagaccggcc taatccgtgt 1620 
gaagggcaat ctggactatg aggaaaacgg gatgctggag attgacgtgc aggcccgaga 1680 
cttggggcct aaccttatcc cagcccactg caaagtcacg gtcaagctca tcgaccgcaa 1740 
cgacaatgcg ccgtccatcg gtttcgtctc cgtgcgccag ggggcgctga gcgaggccgc 1800 
ccctcccggc accgtcatcg ccctggtgcg ggtcactgac cgggactctg gcaagaacgg 1860 
acagctgcag tgtcgggtcc taggcggagg agggacgggc ggcggcgggg gcctgggcgg 1920 
gcccgggggt tccgtcccct tcaagcttga ggagaactac gacaacttct acacggtggt 1980 
gactgaccgc ccgatggacc gcgagacaca agacgagtac aacgtgacca tcgtggcgcg 2040 
ggacgggggc tctcctcccc tcaactccac caagtcgttc gcgatcaaga ttctagacga 2100 
gaacgacaac ccgcctcggt tcaccaaagg gctctacgtg cttcaggtgc acgagaacaa 2160 
catcccggga gagtacctgg gctctgtgct cgcccaggat cccgacctgg gccaagaacg 2220 
gcaccgtatc ctactctatc ctgccctcgc acatcggcga cgtgtctatc tacacctatg 2280 
tgtctgtgaa tcccacgaac ggggccatct acgccctgcg ctcctttaac ttcgagcaga 2340 
ccaaggcttt tgagttcaag gtgcttgcta aggactcggg ggcgcccgcg cacttggaga 2400 
gcaacgccac ggtgagggtg acagtgctag acgtgaatga caacgcgcca gtgatcgtgc 2460 
tccccacgct gcagaacgac accgcggagc tgcaggtgcc gcgcaacgct ggcctgggct 2520 
atctggtgag ccactgtgcg cgccctagac agcgacttcg gcgtagagcg ggcgtctcac 2580 
ctacgaagat cgtggacggc aacgacgacc acctgtttga gatcgacccg tccagcggcg 2640 
agatccgcac gctgcaccct ttctgggagg acgtgacgcc cgtggtggag ctggtggtga 2700 
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gccatttgtc tctgcagttc ctccccactg aagaatcggg agtatatttc 
aatgttgcat ttagtcacat gtctagcttt ggccactgga atattaatct 
aaacttggaa agtgtgcatt catttgtgct cactcactcc tgctatcacc 
gcccaggcca gactgctgct tccagcagaa gataagagac accaagagca 
tcccagacac gctcatgcca gattaaccaa ,tcctcagcta gacccatagc 
taaatgactg ttgtattaag cc 
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aggtgaccga ccacggcaag cctaccctgt 
tgagcggatc ccttcccgag ggggtaccac 
tgtcgctgcc gctcatcgtg actctgagca 
tcaccatcgc cgtcaagtgc aagcgcgaga 
tcgccgagta cagccacccg cagctgggtg 
aaaatgatat' catgctggtg cagagcgaag 
acgtggtgag cagcccctcc ctggccacct 
tgcccctcag ctcgccccgg tcggaggtga 
ctgtccctca ggggcacgcg ggctgccaca 
gcgagacccc tgccactcgg atgtccataa 
attacatggg cagcaggcag cagtttgttc 
cccagaaaga gccagcctga gagacagtgg 
ccaagacact aacaaaggct cctgctgtga 
aactacttca actaaaagcc aaccacttga 
agatgaatgc cgagtgcttg gtcattctga 
caatcaggct gaaaatgcag attaccgcac 
tgttgagact gagacttacg aaactgtgaa 
tggaaaagac aagcgagagc acactattct 
caaacgtgcc ctgagccctc tcctccaaga 
ggcgtgcatc gagccttgca cctcaacaaa 
aacccaggag cccctggctg aagcaagcag 
tgtcacctag taagcaacca agagaccctc 
tctttgcaga tgtgca'ttcc agagccagcc 
agcagcttga ccaccccaac agggatctgg 
tgagagaaat tgataagctt ttgcaagact 
agtgaaaaaa ganaaanaaa anggcattgg 
atgatccctc ctggtgataa cacattttac 
ggcttttagt gaacatctga agtgcccaca 
tcagagataa caatggtttc gttttgacca 
taaagagaaa agaaaaaaag agagaagaaa 
agaattacct tttgacaatc tgttaggaag 
atcactctca gactgtcctc cgtgatttat 
atacaaagcc aggggcataa tttgtgatct 
tctactgaaa gtcctgaaaa gaccttgcag 
atttgtgggt aaacatttct gtataaagtt 
tccatatcca tagtccgaaa accaaaacca 
atctccatgt catattaaaa gttaaatgta 
acgtagacta attcacgtca ttaaagaaga 
agcattttag tgtccaacaa agatttaaac 
aaaagttcta aaaaatagtt accattgagt 
atatgtttta ttcatttgtg gactactaat 
acatacgcta gtcacatgac catttaaatg 
atctattgac atattttatt gatttattgc 
gtgtgtaaga gctatattgt gtattttatt 
tttgggctta tattgtaaat ggacaagtgt 
agatttatgt tttttcctgc tttccttttc 
gagcaaagtc tcttgtttaa tgtatagtct 
aatgtgtttg agtgagctga tggagctaac 
tggttatgtg caagtccttg tagaagcttt 
acatttgtac caatatctga aacttcttca 
gtgtagatat tatgccatca gttggttctc 
tcccattata ttttgaaaac atgaaaaatg 
acttgcattg tgtagtgttt ttcaagaggt 
tcagtgctct tctgcctctt tttgttgggg 
tcatttggtt acacctttcc tttgtgacat 
ggctaagaga cctcagattt caggatttgg 
cagcagatgg gaaaatgcct tgattgacat 
ttacagcttc atgacaaaca gttttgtgcc 
ataaaggctg tttgagcact ggagcagaaa 
ttgtgccttc tcttctggcc accaagccag 
cttatattta aaacaaaaac aaaagcaaaa 
ttttaaactt taaaaacttc tactgaaaat 
ctgtacctcg tttgcgaaac tgctttgaga 
ttctcttgag aactatattt tggttcctag 
ttttattact cataaatgtt taaattagaa 
ttgactctct atattttgga caatttatgt 
tgttattttt gccaggagac tacaggttga 



PCT/US01/06059 

ccgcagtggc caagctcatc atccgctcgg 2760 
gggtgaatgg cgagcagcac cactgggaca 2820 
ctatctccat catcctccta gcggccatga 2880 
acaaggagat ccgcacttac aactgccgca 2940 
ggggcaaggg caagaagaag aagatcaaca 3 000 
tggaggagag gaacgccatg aacgtcatga 3 060 
cccccatgta cttcgactac cagacccgcc 3120 
tgtatctcaa accggcctcc aacaacctga 3180 
ccagcttcac cggacaaggg actaatgcaa 3240 
ttcagacaga caattttccc gcagagccca 3300 
aaagtatttc agtagctcca cgtttaagga 33 60 
gcacggggac agtgatcagg ctgacagtga 3420 
catgtctgtt agggaggcac tcaagatgaa 3480 
acaagaacca gaagagtgtg ttaattgcac 3540 
caggtgctgg atgccacagt tccctgcagc 3 600 
aaatctcttt gtacctacag ttgaagctaa 3660 
tcccactggg aaaaagactt tttgtacatt 3720 
cattgccaac gttaaacctt atttaaaagb 3780 
ggtcccctca gcatcaagca gcccaaccaa 3840 
aggcctcccc tggatggcct gtgaaggcaa 3900 
tcagtacttg ccccactgac agtcaatatc 3960 
ccttcatggc ttccgatcag atggcaaggg 4020 
gggattccag tgagatgggt gctgttcttg 4080 
gcagagagtc tgtggatgca gaggaagttg 4140 
gccggggaaa cgaccctgtg gctgtgagaa 4200 
cattttcttg tctcttctgt tgatttaaaa 4260 _ 
agggatgaag aaagaccaat gctgctttaa 4320 '] 
agtatgttct ttccactgct gatttctttt 4380 
aacttgtatt aggacagaat taatgatgct 4440 
aaggagagat gaaaaaggag gatgaggaga 4500 
gtatgcagtg tgagaactga agtatttctg 4560 
gctgacttaa ctgtttacct ataaaccccc 462 0 
gtggtggatt tctagcagtc atcacaggct 4680 
tagtccaagc tacaccaaac attaacacat 4740 
acctgacaca catataaaca caaggaacat 4800 
aaaaaaaacc tttggtccat ttgtaagacc 4860 
aaaagataca gtccattttg tcctgcacac 4920 
agaaaactta aagatttaaa atgcctattt 498.0 * 
aatgatgaat atgttttaaa tttgacatag 5040 ' 
ggtaagattc agagaaaatt aacttgatta 5100" 
tatagctcta ggaaagtgaa aatgtcttag 5160 
tgctaatgta agaagattca atgtgtttac 5220 
agattcagtg catatgagcc aaattgttga 5280 
aaattaatat atagttgtgt tgcaaaaata 5340 
tgccttggta gcctgtccga acctcctatg 5400 
cccatgggag tgtggggaag cagtgcctca 5460 
accaagtact acagtacata atctgttcaa 5520 
tgaaaggtca aaaattacat ccatcagtca 5580 
tattaaagtc atgctaaatc acaagaattg 5640 
tgttttttca ataacataca gcttctgcct 5700 
aaaagtattt taagtgcttc agatgtgtgt 5760 
ctttaatgca tgtatgtacc agcagtggtt 5820 
ctgggtctta acaaaatgtt ttcctttatc 5880 
tcctttgaga acaatacacc ttctattcct 5940 
ttagcgagtt tcaaacttac ttccatatga 6000 
gaaaaataaa attagcactt gcagaagtag 6060 
tttctttcag catttaaaat ttttggcatt 6120 
cataccttag aaaatgtggt gctgagttaa 6180 
aatgcattat ttgcaaactg gtggataatt 6240 
tgtagaaaca gcaaaaatgt cataaaaatt 6300 
acaaacattg aattacatta agttttgtca 6360 
atttccgcca aatgccatca atattttaga 6420 
gggaagagtg gacaactccc atcagcctta 6480 
taacagcctt tccaaagctc tactcttggt 6540 
aagaagggac cttgtacatg tgaaacctaa 6600 
atctgaaatg tgttgtgctc tgttatatga 6660 
tttagcttga tagctgaaat ttgatggaaa 6720 
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actgatttcc atttagtctt accaagtgtt gcttctctct tactagacag atatccactt 6780 
agtaaaatct aaagcagtat gtaaatgaaa ccagcaaaga gagtagggtt tattttaata 6840 
aacattctta attgctaagt aaaccagttg ttcaatttat tatatgtgtc tgaggacatt 6900 
aaaacaccat taagattgta ataattggtt gtgcccaatg tgtgagggat ttacctttta 6960 
ggctctctgt caccagtgat ttactagtgt tagctgttta acacattatc tgtatttagt 7020 
agtgattatt tatttacaag ttggtggtaa 'ttcagcaggt caggagtcta agcttttata. 7080 
gttgaattga ggaaatctcg cttttattca tttagctggc aactgccttt attgcagacc 7140 
tctggtgctt ggctttcaag gaagcctatg agatgccaaa atcacacctt tagagagcac 7200 
cttgctctaa tag^gtgatg catgagcaaa cagtgagatt tgaaggggtt ttaacataac 7260 
tttagaatgt gaaaacaata tcaattcata tctttcaagt actaacccct caaacaagcc 7320 
cacacataca aaactatgtg atgtgatacc actttgtctt ttaggtcttt aagtaactga 7380 
agttaagcac agaaaaaaaa atcacttcat ggaaatttca gtaagaaacc caaactttct 7440 
agaaaattgg cttggcagat gagctaagca aaaaaaaaaa aaaaaagcaa caaaataacc 7500 
ttttcatcag agttaaaagt agtgagaatc tggttagtta tacgtaatac caaaggtaaa 7560 
cattaaagaa gacattatca ggcaatttca aagaattctt tcatgctatt tcttaaccct 7620 
gacatttcta actttattgc aggcaatata caaagattgg ctcactactc cataggttaa 7680 
ttgaattcct ggttgagaaa ctaacttgtt ttgttttcca aaattagctg aaatcttgta 7740 
aaacatgact tccctttaaa ggatctagat attgttcaat ttaaaatatg gcaccataaa 7800 
aaagtcatgt agtaatagag catatgcttt tttagaacca ggttaaaagc tgtttgttat 7860 
ctaatagagt aaaagttact gaggtcaaca gatagacagg tctggcataa tatatgcccc 7920 
agtcataata gtctaattat attcttttga caacagacag cattcatcag agcactaatt 7980 
cgaatgagag ttttggctat caagatgtgt tgattctaaa tattattgaa tgagacctga 8040 
ttaaagtctg acttttcctg gccccagtgt tttgcaggtt ggctatttcc attatcagcc 8100 
catcactcca taaagttctt agctgccacc aagtgattgg tgaacaagga caacaacaaa 8160 
agcagcatac attgtatgga tttatctcaa cggctattgt ttaatggctg tgtttaaatg 8220 
tgatctatct ggaaaatgag gactccggaa taaaaagcta ttactaataa aaaaaaa 8277 

<210> 132 
<211> 1417 
<212> DNA 

<<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 982697 . 1 :2000FEB18 
<220> 

<221> unsure 

<222> 788, 802, 1173, 1187, 1207 
<223> a, t, c, g, or other 

<400> 132 

gtggtgtggc tgcttctagc 
cttatattta aaagggaagc 
ggcagagaaa gaaaaaagct 
ttgctagaga tgtttgcagg 
aaaaagcctt gaaagcattt 
agcttaggag gagagaatag 
cttgggacac tgctcaccac 
ggggcctaca tacagcttgg 
tttcatgtgg tgttaagcct 
ctctgcctag atttcagagg 
caggggtgga gcccttacaa 
tggaggctcc aacagggtca 
actgccctcc agacctaagg 
agccacangc actcaactgc 
cataggggca ggactgctca 
tgtggaacat gatgtcgaag 
gttttggagc tgtgtggatc 
atggaaacat ttacccaatc 
ttgattttac aggctcataa 
tggacttttg aattaatgct 
actgtanttt gcagtatgag 
tgatttggat ctgcatcccc 
gttgagcctg gtggaagagg 
ccctgggtgc tgttctcatg 



cacttatatt cagatgtggg 
agagcaaaaa aagtttggaa 
ttttcagtag aggaattcaa 
attaaagggg agccaagtgc 
caaagaactt cagagtggct 
attcatggac tgggcccagg 
atcctagctg ctcctgttcc 
actgccactt tggagaatgc 
gcaggcacac aaagtgcaag 
acatatggaa aagtctgttt 
agaacctcta ctagggcagt 
ccgctggggc actgcctaat 
atgatagatc taccagcagc 
ancccatgag cgcagccttg 
aggcctgggg agcccacccc 
attattttgg aactttaaga 
ctgtagaccc tttcttttgg 
cctatacccc cattgtatct 
gtggaaggga cttgccttgt 
ggaatgagtt aanactttga 
aaggacatga gatttgggag 
accaaaatct catgttcaat 
ttggataatg ggggtggttt 
acagtgagtg agttatt 



agcaaagaca aaaggtagaa 60 
aatttgcagc ctggccatgt 120 
gcagactgtg gaacaattac 180 
tactatccaa gacaataggg 240 
tctcccatca caggcccaga 300 
gccatggtac cttgcacagc 360 
cactcttgcc atggctcaaa 420 
aaacagtaaa ccttggtagc 480 
aatcgtggat tcttggcagc 540 
gtccaggcag aagcctgtta 600 
atagagggga aatgtggggt 660 
ggaactatac gaagtgggcc 720 
ttgtattctg cacctggaaa 780 
gattctttac cctgcaaagc 840 
ttgtaccagt gtggcctgga 900 
tttagtgact tccctactgg 960 
ccaatttccc cctttttggg 1020 
tggaagtaaa taatttgttt 1080 
ctcatatgag actttggact 1140 
gggactnttg ggaagatata 1200 
acaccagagg tggaataata 1260 
tgtaatccta aattttggag 1320 
ctcatggttt aacaccatcc 1380 

1417 
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<210> 133 
<211> 1143 
<212> DNA 

<213> Homo sapiens 
<220> 1 

<221> misc_f eature 
<223> Incyte ID No: 



LG: 1080896. 1:2000FEB18 



<400> 133 

gccgacagca 

caggctcctt 

gaagtgtgga 

cgtcctgatc 

ccgtgtaaaa 

cagagtgacc 

ccacgcacag 

cttacagtaa 

caagtcactg 

tgtaccagga 

ccaggaaata 

ttttaaaata 

aacagaagaa 

tgatagtaag 

cagaaaataa 

gtgtcatttg 

aaagtcagcc 

ccctcagacc 

gatcttagtg 

aaa 



taatgtgagg 
tgaagatctt 
tcaaggactg 
actcccactg 
cctgcagcgc 
ctgaagaaga 
ccgaagcatg 
gggcttcaac 
aggtaggaca 
ctctaaaaga 
accaacttga 
caattaagga 
aaaggggtcc 
cagcatggaa 
gtacatctgt 
agctacttaa 
cctcctgcac 
caagtaaaat 
taagcatcct 



caagtctagc 
caggtggact 
gaacgtagcc 
ctgtgaacgt 
ctgaatcctg 
tgacaagccC' 
aggaagctca 
tgaccttcct 
aaagattgct 
aacttgtttg 
tgcgtgttat 
ctggtccttt 
ccaaaaatgt 
taggatgcgg 
caaaaatcat 
aaagaaaata 
aagtgggagc 
aaaaaaaaat 
aataaaagga 



tgttcccccc 
tcacagaaat 
tcttgtgccc 
agagagaatc 
ggaggcaaga 
tgctccagtc 
ttgtgggctt 
cagactgaga 
acattcctat 
, tataatgcta 
gacccatttt 
tctaggtgac 
aaccttaaaa 
ttctctaaat 
atttatgcga 
aaaaaagatc 
tgcatccttt 
gtaacattag 
gaggttcaaa 



cggcatacaa 
tgccaagttg 
atggtgaagg 
ctagcctgga 
ccaagtctgg 
acacccagaa 
catttttctt 
actgtttcca 
tattttatgg 
ttctatccaa 
aagcctccca 
acaagaaagg 
tttgacgctt 
tggaaaaaaa 
gatgtgtcaa 
ctgtttggct 
taaaattgat 
gcattgatgg 
aacgctctcc 



gcttatagag 
tggaggtgtc 
acccagatgt 
tccatcacaa 
acaacccctg 
gctgactggt 
aaattttgga 
gtatatacat 
ttattataag 
ggtatgtagc 
tgatcacagt 
aaa t age tag 
gtgecactat 
agttacacag 
tactggtctt 
ccaaaaatga 
aatcacaaac 
aaaagggctg 
agaaccagta 



60 

120 

180 

240 

3.00 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1143 



<210> 134 
<211> 899 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_f eature 
<223> Incyte ID No: 



LI: 811341. 1:2000FEB01 



<220> 

<221> unsure 
<222> 500 
<223> a, t, c, 



g, or other 



<400> 134 

acctgacctg 

agttttaacc 

agacaggatt 

agcacaagaa 

gtggtaactt 

accaaagctt 

caccaccttg 

tcagtttatt 

gggtacgtgg 

gctctgctcc 

etcattgegg 

cttcctcaga 

ttgctacagt 

gtttgtataa 

gttatgaccc 



acaaaactct 
catactgtga 
tccaaaggct 
gcagataaac 
taatgactac 
gctatgtgaa 
ctcctggtat 
ctagcaggcc 
aegggagtan 
agtcacaccc 
aactcatttt 
ctgaggactg 
cctattattt 
tgttattctg 
atetgagect 



ttacactcta 
ggccctggcc 
ggcccccagg 
taacccttag 
caaaggacct 
aatccccaca 
cagagagect 
caacctccga 
ttcgccaacc 
agaagctgac 
ccttaaaatt 
ttccagtgta 
tatggttatt 
tacaaggtat 
cccatgacca 



tgtgtcagaa 
aaggccagtg 
tetaagggee 
gaaaacctga 
cattggttac 
taaccactga 
agttgaacat 
gactatcctt 
ectgeaaagt 
tggtccacgc 
tggactttta 
tacatcaagt 
ataagtgtac 
gtageccagg 
cggtttttaa 



agagaaaaaa 
gcctatctct 
c tagcagcaa 
atataaaggc 
aaatgetaga 
agtttgcaac 
actgtgtaga 
aaacatcagt 
gactctgaag 
aeggecgaag 
cagtaaggac 
cactgaggta 
tggaactcta 
aagtgaccaa 
aataagatta 



tggcagttgg 60 
c aaaac aac t 120 
cagccctgtt 180 
cccccatgct 240 
ttaaccaagt 3 00 
accctaaccc 360 
ggtgttggac 420 
agactgtgag 480 
aagacgacaa 540 
catgagaaaa 600 
ttcaactgac €60 
ggacaaaagg 720 
aaaataactt 780 
cctgatgtgt 840 
aggactgag 899 



<210> 135 
<211> 923 
<212> DNA 
<213> Homo sapiens 
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<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 903225 . 1 : 2000FEB01 
<220> 

<221> unsure 
<222> 239 , 

<223> a, t, c, g,, or other 
<400> 135 

ggaggaaggt aatgaaggag attatcttat cctacttatt attttacttc cttattctgt 60 
gggacagaag aatctcttga gtcctctaaa aaactatatg ctgatagcag gagttacaga 120 
acaattcagt agtgtctgga atagaacagt ggtttgaaaa gtagagggct tgttaaagca 180 
cgggatgctg attcagtagg ccttgagtgg ggataattta catatccaac aggccaggng 240 
atgccctggc ttgggigatca cattttgtga ctgatgcaga gtactgatgg tcatceagtc 300 
cccttccttc ggtgtttgag accacttcat ctggaccgag ctaaagtcta ggaagaaata 360 
aagtttcaaa cccagtagag ttacctcaaa gatacacttg agaccctttt cagaagatgg .420 
caccgaaagt gaagaaggaa gctcctggcc cgcctaaagc tgaagccaaa gcaaaggctt 480 
taaaggccaa gaaggtagtg ttgaaaggtg tccacggcca caaaaaaaag aagatccgca 540 
tgtcacccac cttccagcgg cccaagacac tgagactctg gaggccgccc agatatcctc 600 
ggaagaccac ccccaggaga aacaagcttg accactatgc tatcatcaag tttcctctga 660 
ccactgagtt tgctatgaag aagataaaag acaacaacac ccttgtgttc actgtggatg 720 
ttaaagccaa caagcaccag atcaaacagg ctgtgaagaa gctctgtgac attgatgggg 780 
ccaaggtcaa caccctgatg gagagatgaa ggcatatgtt ccactggctc ctgattatga 840. 
tgctttggat gttgccaaca aaattgggat catccaaact gagtccatct ggctaattct 900 
aaatatacat atctcttttc acc 923 

1 i 

<210> 136 
<211> 343 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 242079 . 2 :2000FEB01 
<400> 136 

gctcagggtc ctgc'tgccct gcacaacctc aggaaactgc tccctgaatc ccagctacta 60 
cagctccagc tgtggctcaa aggaccccag gtatagctca ggctgcctgt ccagaggatg 120 
caagccataa gccttagtgg ctcccgtgtg gtgttaaatt aagcctgtgg gcacatagag 180 
tgcaagaatt gaggcttggg agcctccaac tagatttcag agtatgtgtg ggaaagcctg 240 
gatgtccagg cagaagccag ctgcagggac agagccctca tggagaacct ctactagggt 3 00 
agtgtggagg ggaaatttgg ggttggagtt cccacacagc ttc 343 

<210> 137 

<211> 1227 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LG: 979580 . 1 : 2000MAY19 
<400> 137 

cttggatatc tgctcatttg gcccgatgag aagagaaaaa cagaaaacaa aagctctgag 60 
tgggtagagc gtggggggag cttgaagctg ttattaaagg aaggtatgat taactcctaa 120 
ctcacagccg cttcagctca tctccatggc tcagccacca ggccagcctt tgctgtcacc 180 
taacagataa ttattgtctt ttctacaata tgattaagtg gaaggagaat cagaacactg 240 
tagaactgga tctgatattt cttgttagca tatatatata tatttaatct atgtgcttgt 300 
ccttgaaaac agcacttcct gtacccagca atcacagagc tgctcttgga atttaacacc 360 
ctggtggttc tcaaatgcca tctgaaaaac attattagtg ttacccatca tgacctctgg 420 
gtagtggaag gtgaatggga gaaatctctc cccatttgaa gaaataggga accagtctca 480 
ctttgttgcc caggctggag tgcagtggca caatcttgct cactgcaacc accacctccc 540 
gggttcaagc gatcctccca cctcaacctc ccaggtagct gggtctgcag gcgtgcgcca 600 
ccacacccgg ctaatttttg tgtttttagt acagaaagag tttcaccatg ttgaccaggc 660 
tggtctcaaa ctcctaacct caagtgattg gcccacctgg gcctcccaaa gtgctgggat 720 
tacaggtgtg agccactgca gcccagccta cgaagttgtt tttgcggtta agcagcaatt 780 



66/228 

f 



BNSDOCID: <WO 0162927A2J_> 



, '/ 



WO 01/62927 



PCT/US01/06059 



tggaaacgaa 

cctatgaact 

ccttctctat 

gaagtcagga 

gcataaaatg 

tatagctttt ' 

tctcacgtac 

atcatttttt 



gcatttctga 
cccctccgfca 
gattagtaaa 
cccagtggat 
tattaaaatt 
aactaagcat 
c^taatcttt 
tattgtatgc 



ggagctcagt 
gctcatccct 
tttcttcagc 
tatagcattg 
tatgtctgct 
gtaacggggg 
gtgcttgggt 
acaatct 



ttaagaaaca 
ctacttttcc 
tagaaatgtt 
ttacacttca 
cttatctctg 
cagcaccaca 
ctaataagta 



ctgtgatgtt 
atctaacccc 
acagtctgtt 
tgctcatgtt 
ccttgcaaag 
tctggaattc 
caacccccat 



gtgttttaaa 
agtactggca 
ttttaacaat 
atataaaatt 
aatgaatttg 
aagctgactt 
ggtgtggttt 



840 

900 

960 

1020 

1080 

1140 

1200 

1227 



<210> 138 
<211> 1259 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc — feature 

<223> Incyte ID No: LI : 1169865 . 1 :2000MAY01 



<400> 138 

ccggttgtag 

cgtccctgga 

gaggcccaca 

tgtgctcaca 

tatcagagct 

tggggccatc 

aacaggggag 

gaaggagacc 

cgggctggct 

ccacactcag 

gctgggccag 

cgtcacgtgg 

gcccagaaca 

ggacttctat 

ggaggcccaa 

cctggtgctg 

ttctggccat 

tctgaacaga 

agcttttctt 

caagctctgt 

tagaaattaa 



atgtatctct 
aaatgtgaat 
gtttcagtct 
gtcatcaatt 
gtgatcttga 
aaggcggacc 
tttatgtttg 
gtctggcatc 
aacattgcta 
gccaccaacg 
cccaacaccc 
ctgtgcaacg 
gattacagct 
gactgcaggg 
gagccaatcc 
ggcctagtcg 
gacccccggg 
agaggactta 
ctcacttcat 
ccttatccat 
tgacactttg 



ccaggaagcg 
cccagccatg 
catgtgcctc 
atagacccca 
gagccctctc 
atgtgtcaac 
aatttgatga 
tggaggagtt 
tattgaacaa 
atccccctga 
tcatctgcca 
gggagctggt 
tccacaagtt 
tggagcactg 
agatgcctga 
gcatcatcgt 
cccaggggac 
ggagagatct 
gtgaaaacta 
tacctcaaag 
gtagcactaa 



ctgtgtccca 
ctgattcctg 
cagtcggcct 
caacatgcgc 
cttggctttc 
ttatgccgcg 
agatgagatg 
tggccaagcc 
caacttgaat 
ggtgaccgtg 
cattgacaag 
cactgagggt 
ccattacctg 
gggcttggac 
gacaacggag 
gggctccgtc 
cctgtgaaat 
gaactccagc 
ctccagtggc 
cagtcattcc 
tatggagatt 



ttaaacgcgt 
tccagccatt 
cagttcctca 
cctgaagaca 
ct;gctgagtc 
tttgtacaga 
ttctatgtgg 
ttttcctttg 
accttgatcc 
tttcccaagg 
ttcttcccac 
gtcgctgaga 
acctttgtgc 
cagccgctcc 
actgtgctct 
ctcatcataa 
actgtaaagg 
tgccctacaa 
tgactgaatt 
ttagtaaagt 
atcctttcat 



agattcctgc 

tccagtgcta 

tcactgttcc 

gaatgttcca 

tccgaggagc ■ 

cgcatagacc 

atctggacaa 

aggctcaggg 

agcgttccaa 

agcctgtgga 

cagtgctcaa 

gcctcttcct 

cctcagcaga 

tcaagcactg 

gtgccctggg' 

agtctctgcg 

tgacaaaata 

actccatctc 

gctgaccctt 

ttccaacaaa 

tgagccttt 



60 

120 

180 

240 

300 

3 60 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1259 



<210> 139 
<211> 2604 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<223> Incyte ID No: 



IiG : 337818 . 2 : 2000FEB18 



<400> 139 

cgcggagcct 

gcgctgctgc 

gcccgaggcc 

ctacggcccg 

ttcaccatct 

cgggaggccc 

gaagggactt 

aggaagttta 

ctgatccagg 

ttcgatccct 

ggcctccgct 

accctgctgg 

cggcccctgc 

acagtccggc 

cttgtcgatg 

accaacaaga 



gggagaggag 
tggcgctggc 
acctgccccc 
gggcgctgta 
acctgggacc 
tgggaggtca 
ttgatggcca 
ccatgcttgc 
cggaggcccg 
ccctgctgct 
tctcctatga 
gagtcagctc 
caggccccca 
aggtgcagca 
ccttcctgct 
acatgctgat 



aaggagccga 
gctgctcctg 
cgggcccacg 
ttcagggctc 
ctggcggcct 
ggctgaggag 
tggggttttc 
tctgcgggac 
gtgtctggtg 
ggcccaggcc 
ggataaggag 
ccaggggggt 
caagcagctc 
gcaccagggg 
gaagatggca 
gacagtcatt 



cctgccgaga 
ctgctgacgc 
ccgctaccac 
atgcggctga 
gtggtggtcc 
ttcagcggcc 
ttctccaacg 
ctgggcatgg 
gagacattcc 
acctccaacg 
ttccaggccg 
cagacctacg 
ctccaccacg 
aacctggatg 
caggaggaac 
tatttgctgt 



tggaggcgac 
tggcgctgtc 
tgctgggaaa 
gtaagaagta 
tggttgggca 

ggggaaccgt 

gggagcggtg 
ggaagcgaga 

aggggacaga 

tagtctgctc 
tggtccgggc 
agatgttctc 
tcagcacctt 
cttogggccc 
aaaacccagg 
ttgctgggac 



cggcacctgg 60 
cgggaccagg 120 
cctcctgcag 180 
cggaccggtg 240 
ggaggctgtg 3 00 
agcgatgctg 3 60 
gaggcagctg 420 
aggcgaggag 480 
aggacgcoca 540 
cctcctcttt 600 
agctggtggt 660 
ctggttcctg 720 
ggctgccttc 780 
cgcacgtgac 840 
cacagaat tc 900 
gatgacggtc 960 



67/228 



BNSDOCIO: <WO 01 62927A2_I_> 



WO 01/62927 



• 

rgcxat. 



agcaccacgg tcggcxatac 
gtacgtgagg agctgaatcg 
acccgcctcc cttacaccga 



cccatgggaa 
cagggcacgg 
cacccagaag 
gaggcgttcc 
gcggagtgct 
ccgccggaca 
ttccagctgc 
gcaacttgga 
agggttaatg 
ttccggagtc 
cgcacaccca 
cataaatata 
gatctgcagc 
acatgttcac 
gtgcctggca 
gtctatggcc 
ccgatgtcca 
ggctccccag 



taccccgcac 
aggtcttccc 
agttcaaccc 
tgcccttctc 
tcctcttctt 
ccctgagcct 
aagtccgtcc 
agtggtgggt 
tctccagagt 
tgtcccacgg 
tacacaacta 
gtccatctgc 
ccacacgtgg 
agctcacacg 
cagggaacag 
ccaactcatg 
cacaccccca 
agtatcttcc 



cacctctgca actgcagccc 
aaacacatct gggtctgcga 
ggaggaacac ctgcccaacc 
ctgcccctcc acacacacat 
ccccactaga cccagtggaa 
tgtcagggag tgagccggat 
aagtttgttt ctggccttta 

<210> 140 

<211> 2114 

<212> DNA 

<213> Homo sapiens 



cctcctgctc 
ggagctgggg 
cgcggttctg 
cctcatgcgg 
cctccttggc 
agaccgtttc 
cttagggaag 
caccaccatc 
caagcccacc 
cactgacctt 
gcccaggacg 
gtacactgca 
cccacacgct 
caagggccac 
aatcacaagc 
gagtctggct 
ccctctccat 
catgccccct 
ctccctctct 
accacttgtc 
cactgagaca 
tcagtcaccc 
ttatgcacag 
ccaacacgtg 
acacactcac 
ggggttagag 
ctgacgttcc 
gcct 



ctgatgaaat 
gctggccagg 
catgaggcgc 
accacccgct 
tccatcctgc 
ctggatgcag 
cgtgtctgcc 
ctacaagcct 
gtcagtggcc 
cactccacca 
gtgcctccag 
ggcagccaca 
cacttgactc 
aaagcaactg 
acatagccag 
gtcaccttca 
tcatcgaact 
ccggggtcat 
tggctacacc 
cacacagcta 
cgccgccccc 
ctttttaagc 
agactttgga 
cttatgtaac 
tgatctacag 
accaagtagg 
ttgtgactta 



ccccai 



PCTYUS01/06059 



acccucatgt 
caccaagcct 
agcggctgct 
tccgagggta 
atgaccccaa 
atggacggtt 
ttggagaggg 
tctccctgga 
ttttcaacat 
cgcagaccag 
cctcaacagt 
tttacacgcc 
atgctgctaa 
ctgggttagc 
gtaacccacc 
caagccacag 
tctcagtgtc 
gccacccaga 
actctcccag 
cccacgtacg 
acagaggcac 
accctgattc 
catacgagga 
cacgtggaaa 
cccctgttcg 
ggccagtttc 
agggtccggc 



ccaaaagtgg 

aggggaccgt 

g'gcgctggtg 

caccctgccc 

catcttcaag 

caggaagcat . 

cctggcaaaa 

gagcccgtgc 

tcccccagcc 

atgaaggaag 

gggcatggac 

tgcagttgtt 

gatgcacaac 

tttccacaga 

aactcccctg 

aaacggccac 

cctgtccctg 

gactgtcgct 

cctgtgacca 

acatcgtcct 

agtccccagc 

taccaaatgc 

ccctcagacc 

gcggcccctg " 

gcgtcagagt 

caattcaccc 

ttgggaatta 



1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2604 



<220> 

<221> mis cofeature 
<223> Incyte ID No: 



LI:337818.1:2000FEB01 



<400> 140 

cgcggagcct 

gcgctgctgc 

gcccgaggcc 

ctacggcccg 

ttcaccatct 

cgggaggccc 

gaagggactt 

catgctgatg 

cggctatacc 

gctgaatcgg 

ttacaccgac 

accccgcacc 

aggtcttccc 

agttcaaccc 

tgcccttctc 

tcctcttctt 

ccctgagcct 

aagtccgtcc 

agtggtgggt 

tctccagagt 

tgtcccacgg 

tacacaacta 

gtccatctgc 

ccacacgtgg 

agctcacacg 

cagggaacag 

ccaactcatg 

cacaccccca 



gggagaggag 
tggcgctggc 
acctgccccc 
gggcgctgta 
acctgggacc 
tgggaggtca 
ttgatggcca 
acagtcattt 
ctcctgctcc 
gagctggggg 
gcggttctgc 
ctcatggcgg 
cctccttggc 
agaccgtttc 
cttagggaag 
caccaccatc 
caagcccacc 
cactgacctt 
gcccgggacg 
gtacactgca 
cccacacgct 
caagggccac 
aatcacaagc 
gagtctggct 
ccctctccat 
catgccccct 
ctccctctct 
accacttgtc 



aaggagccga 
gctgctcctg 
cgggcccacg 
ttcagggctc 
ctggcggcct 
ggctgaggag 
tggaggaaca 
atttgctgtt 
tgatgaaata 
ctggccaggc 
atgaggcgca 
accacccgct 
tccatcctgc 
ctggatgcag 
cgtgtctgcc 
ctacaagcct 
gtcagtggcc 
cactccacca 
gtgcctccag 
ggcagccaca 
cacttgactc 
aaagcaactg 
acatagccag 
gtcaccttca 
tcatcgaact 
ccggggtcat 
tggctacacc 
cacacagcta 



cctgccgaga 
ctgctgacgc 
ccgctaccac 
atgcggctga 
gtggtggtcc 
ttcagcggcc 
aaacccaggc 
tgctgggacg 
ccctcatgtc 
accaagccta 
cgggctgctg 
tccgagggta 
atgaccccaa 
atggacggtt 
ttggagaggg 
tctccctgga 
ttttcaacat 
cgcagaccag 
cctcaacagt 
tttacacgcc 
atgctgctaa 
ctgggttagc 
gtaacccacc 
caagccacag 
tctcagtgtc 
gccacccaga 
actctcccag 
cccacgtacg 



tggaggcgac 
tggcgctgtc 
tgctgggaaa 
gtaagaagta 
tggttgggca 

ggggaaccgt 

acagaattca 
atgacggtca 
caaaagtggg 
ggggaccgta 
gcgctggtgc 
caccctgccc 
catcttcaag 
caggaagcat 
cctggcaaaa 
gagcccgtgc 
tcccccagcc 
atgaaggaag 
gggcatggac 
tgcagttgtt 
gatgcacaac 
tttccacaga 
aactcccctg 
aaacggccac 
cctgtccctg 
gactgtcgct 
cctgtgacca 
acatcgtcct 



cggcacctgg 
cgggaccagg 
cctcctgcag 
cggaccggtg 
ggaggctgtg 
agcgatgctg 
ccaacaagaa 
gcaccacggt 
tacgtgagga 
cccgcctccc 
ccatgggaat 
cagggcacgg 
cacccagaag 
gaggcgttcc 
gcggagtgct 
ccgccggaca 
ttccagctgc 
gcaacttgga 
agggttaatg 
ttccggagtc 
cgcacaccca 
cataaatata 
gatctgcagc 
acatgttcac 
gtgcctggca 
gtctatggcc 
ccgatgtcca 
ggctccccag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 



68/228 



BNSDOCID: <WO 0162927A2_I_> 



WO 01/62927 



PCT/US01/06059 



agtatcttcc 
actgcagccc 
gggtctgcga 
ctgcccaacc 
acacacacat 
cccagtggaa 
tgagccggat 
ctggccttta 



cactgagaca 
tcagtcaccc 
ttatgcacag 
ccaacacgtg 
acacactcac 

ggggttagag 

ctgacgttcc 
gcct 



cgccgccccc 
ctttttaagc 
agactttgga 
cttatgtaac 
tgatctacag 
accaagtagg 
ttgtgactta 



acagaggcac 
accctgattc 
catacgagga 
cacgtggaaa 
cccctgttcg 
ggccagtttc 
agggtccggc 



agtccccagc 
taccaaatgc 
ccctcagacc 
gcggcccctg 
gcgtcagagt 
caattcaccc 
ttgggaatta 



cacctctgca 
aaacacatct 
ggaggaacac 
ctgcccctcc 
ccccactaga 
tgtcagggag 
aagtttgttt 



1740 
1800 
1860 
1920 
1980 
2040 
2100 
2114 



<210> 141 
<211> 773 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_f eature 
<223> Incyte ID No: 



LG : 241577 . 4 : 2000MAY19 



<400> 141 

ccaggtcgcg 

tgctgaggaa 

cctggactcc 

tgcccccttt 

atgatcccac 

ggtgggggga 

gaagagccct 

ggaagctgga 

gagacctttg 

ttcttcttca 

ccaaagccga 

atgaaggtga 

aagaagtgac 



ctcccactgc 
aggtggaggt 
ttcctccctg 
gcccttccat 
tccttggtcc 
agatgtcccc 
tctgcacaga 
cccagctgac 
cggagatgaa 
ttggattcgc 
tcaccttgac 
atcctgtgca 
ttgcatcccc 



cgagcccgcg 
ggaagacgag 
cctaactcca 
gccaccctgg 
aaggtcaagt 
ctacaccaac 
actcaacgct 
ccacgccgaa 
ccgtcgctcc 
agctctggtg 
ggacgagcgg 
gggcctggcc 
agctgtctcc 



agatgctccc 
ggatgcacag 
gagatagggg 
gcctgttcct 
ttctccagga 
tgctatgccc 
gaggagcagg 
aaggtggcct 
aatgagtgga 
atttggtggc 
aaagcccagc 
tcccgctggg 
ctggagtccg 



cagagctgcc 
ctcagaaggc 
caggggtccc 
accacacttc 
aatggagtgc 
agcgctacta 
ccctgaagga 
tgtaccggct 
agacagtgat 
agcgggtcta 
agctgcagcg 
actatgagaa 
ccctggctgg 



tggagcttgg 60 
accagtgaga 120 
ctctgcaggc 180 
ccatgttccc 240 
gcgggcccgt 3 00 
ccccatgcca 3 60 
gaaggagaag 420 
ccagttcaat 480 
gggttgtgtc 540 
. cgtatttcct 600 
catgctggac 660 
gaagcagtgg 720 
gac 773 



<210> 142 
<211> 296 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> mis cofeature 
<223> Incyte ID No: 



LG : 344786 . 4 : 2000MAY19 



<400> 142 

cccatcttgt 

tccttgcttt 

ccaccagtaa 

agcttgagga 

aggcattttt 



ggagcgtact ttcattttca atagagcttt gcttttgttg cttcattctt 60 
gtgtgttttg tccaattctt tgttcaagac accaagaacc tagacaccct 120 
catattctag tgagccagcc aggagtttat tcaggatgat aacatggaga 180 
aattattgaa aaataccctc gtgccttccc tttctggatt gggccctttc 240 
ctgtatctat gacccagact atgcaaagac acttctgagc agaaca 29 6 



<210> 143 
<211> 1171 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 4143 07 . 1 : 2000FEB01 



<220> 

<221> unsure 
<222> 574, 583 
<223> a, t, c, 



g, or other 



<400> 143 

cgggatggga aaactatgcc tggggccgac gctctgcccg gctgctgccg ctgaggaaag 60 
ccgggacgcg gagccccgcc gagagcttct ttgctccgga cgcccctgga cgtggcgggc 120 
agccgcgagg gtaaccacca tgatcccctg ggtgctcctg gcctgtgccc t<:ccctgtgc 180 



69/228 



BNSDOCID: <WO 0 1 62927A2_I_> 



WO 01/62927 



• 

rcttgg 



PCT/US01/06059 



tgctgaccca ctgc€Tggcg cctttgctcg cagggacttc 
ggtctgcagc ctgcctggcc cccagggccc acccggcccc 
aggaatgatg ggacgaatgg gctttcctgg caaagacggc 
ccggggggac agcggagagg aaggtccacc tggccggaca 
accaaagggc aaagccgggg ccattgggcg ggctggcccc 
cggtaccccc gggaagcatg .gcacaccagg caagaagggg 
gccaggcctc, ccaggcccct gcagctgtgg cagnggccat 
ggcagtgacc aagagctacc cacgggagcg gctgcccatc 
gaacgagggt ggccactaca atgcttccag cggcaagttc 
ctactacttc acctacgaca tcacgctggc caacaagcac 
caacggccag taccgcatcc ggacctttga tgccaacacc 
aggctccacc atcctggctc tcaagcaggg tgacgaagtt 
agagcagaac gggctcttct atgaccctta ctggacagac 
aatctatgcc gaccaggatg accccaacga ggtatagaca 
cagggaacaa gcttctggac ttgggcttac agagcaagac 
gtggggggtc gagtgagcgg ttctagcctc aggctcacct 
cttcattaaa tccaaagctt tttattcatc c 

<210> 144 
<211> 372 
<212> DNA 
<213> Homo sapiens 



cggaaaggct 
ccaggagccc 
caagatggac 
ggtaaccggg 
cgtggcccca 
cccaagggca 
acnaagtcag 
aagtttgaca 
gtctgcggcg 
ctggccatcg 
ggcaaccacg 
tggctgcaga 
agcctcttta 
tgccacggcg 
cccacaactg 
cctctgcctc 



cccctcaact 
cagggccctc 
acgacggcga 
gaaagccagg 
agggggtcaa 
agaaggggga 
ctttctcggt 
agattctgat 
tgcctgggat 
gcctggtgca 
atgtggcctc 
tcttctactc 
cgggcttcct 
gtcctccagg 
taggctgggg 
tttttttccc 



240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1171 



<220> 

<221> misc_feature 
<223> Incyte ID No: 



LI : 202943 . 2 :2000FEB01 



<400> 144 

gtgtgaaagc 

ggctccagga 

tcacattttc 

agtctggaga 

cgtttcttat 

tatcaatgtt 

cacacacaca 



cacttattat 
aatccggaaa 
gtgtcaggaa 
atggaatcat 
gtttggcaat 
ctttgtattt 
cc 



tccaccaaaa 
atggactcct 
ggataccagt 
ctaataccat 
cctagaaagg 
gaactgaggt 



ctgtctgttt 
caggtgagat 
tgatgggagt 
attgtaaagg 
taaggcgtag 
attaggctca 



ctagaaccta 
ttatacagta 
aaccaaaatc 
tatgtttagt 
acatatgaag 
tgtttgaaca 



taaaatgtaa 60 
ggtgccgaag 120 
acatgtttgg 180 
aaatttacca 240 
tgttatgttg 300 
tgaccccaca 360 
372 



<210> 145 

<211> 337 " • 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 246194 . 2 : 2000FEB01 
<220> 

<221> unsure 

<222> 73, 82, 105, 126, 154, 334 
<223> a, t, c, g, or other 



<400> 145 

tcctcaaccc 

ccagccacca 

gttggngccc 

tcgggctcca 

gccgggcctg 

caacacccaa 



accaccgccg 
ggngctcctg 
ccactcctgc 
ctgccaggtc 
cagcccagcg 
taaagttcct 



cagcactggt 
cnctgctgcc 
atccaccacc 
agatgctgct 
tcatggagga 
ttttaacaca 



gcccccaccc 
tccgccgcac 
tgcncagtcc 
gagcgggggt 
cgacatcctc 
aaanaaa 



caggcctccc 
caggncctgg 
tggcccgcac 
ccccggggcc 
atggatctca 



tccaccacct 60 
ggcagcccca 120 
aacttccccc 180 
cggtccccca 240 
tctgaatccc 3 00 
337 



<210> 146 
<211> 866 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 



LI : 815961 . 1 :2000FBB01 



70/228 



BNSDOCID: <WO 0162927A2_I_> 



I 



WO 01/62927 



PCT/US01/06059 



<220> 

<221> unsure 

<222> 741, 759-760, 763, 788, 837 
<223> a, t, c, g, or other 



<400> 146 

gaaaggatgt 

aatgtgtggt 

gggttctggt 

cttcaggata 

taaatgagct 

gggtggaccc 

ggatggaggc 

gagtcaccaa 

atggggttgg 

agatgggact 

tggtggccga 

tgatgttggt 

gtaaggatgt 

tggtcagnat 

gaggaatggt 



tgcagtcata 
gaagggtatg 
gcctgtactg 
gtgtgtggag 
tgcggatggg 
tgtgaccttg 
agaagtggac 
ggaggtggag 
ataggtagtg 
gctccctgta 
ggtggtgtgg 
ggtagaagtt 
agaggttttg 
cattggtggg 
gcctgttggc 



ggacctgtgg 
gttagcctgc 
gtgtggttgg 
gaagtgtgtg 
ctggtcccac 
agcgttgtta 
atctgtgcat 
aaaggtggaa 
gtggtctgaa 
ggtggggaat' 
gccacagggg 

ggggtgactt 

nctgtgctaa 
gctgtgtggg 
attgag 



aagagatggg 
tgctggtggc 
gggtgatggt 
aatgtaagga 
tggtggtcac 
gtggaggaat 
gggtaggggt 
tgtgagtggg 
aggatggtgc 
gtgtggtgaa 
ttctggtgcc 
caggatagtg 
atgagcttnn 
tggaccctgt 



actgctgcct 
cgaggtggtg 
ggtggtagaa 
tgtagaggtt 
tgtgattggt 
ggtgcctgtt 
gatgaccttg 
aagtgtggtc 
agtcatagga 
gggtatgggt 
tgtactggtg 
tgtggaggaa 
ggnttggctg 
ggccttcagc 



gtagttgggg 60 
tgggccacag 120 
gttggggtga 180 
ttgcctgtgc 240 
ggggctgtgt 300 
ggcgttgagt 360 
tgagtacttg 420 
tgagggtgtg 480 
cctgtggaag 540 
agcctgctgc 600 
tggttggggg 660 
gtgtgtgaat 720 
gtcccactgg 780 
gttgtcngtg 840 
866 



<210> 147 

<211> 1115 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 
<223> Incyte ID No: 

<220> 

<221> unsure 
<222> 776 
<223> a, t, c, 



LG:12 0744.1:2000MAY19 



or other 



<400> 147 

cgctgcaggg 

ctcgggttgg 

gctgcgacgg 

aacgaaaaca 

cttcagaatt 

acagatttag 

gtgaaaaagt 

gttggagtgg 

tcaggagaac 

acagggaagg 

actaagatag 

caagcagcaa 

gatgccgaac 

gactatattg 

gtctgtccaa 

gtttagatgt 

gatatgaagt 

gtcttcctcc 

atttatggag 



agaacggact 
gcttgccact 
caaccccgag 
gaaacatggc 
ctgtgttagc 
ttttcatctt 
ggcttgtcaa 
ttcaatatag 
atttgacggc 
ccatccagtt 
cagtggtact 
gagatagtaa 
ttagagctat 
caatatccaa 
cacgaattcc 
aaataaaaag 
aacatcaaaa 
atcatatgta 
aatattaact 



ccgggcggag 
gcctggaaca 
cctggacact 
tcactatatt 
tgaagatggg 
agatggctct 
tatcacaaaa 
tgactaccct 
agcagtggaa 
tgcgctcgat 
tacggatggc 
gataacatta 
tgccaacaag 
aataagggaa 
agtggcagct 
gttaagaaaa 
gttgatttat 
tttgtgtcta 
attgatggaa 



ggcagccaat 
tacctgtccc 
gcgccaggaa 
acatttctct 
gaagtaagat 
tatagtgttg 
aactttgaca 
gtgctggaga 
tccatactct 
tacctttttg 
aaatcccaag 
tttgctattg 
ccttcgtcta 
gtgatgaagc 
cgtgatgaaa 
gaatacagct 
cagaactcac 
ctcaaagatt 
ggcca 



ccgtttcagc 
cctggcgcaa 
tcctaaaacc 
gcatggtttt 
caagttgtcg 
gcccagaaaa 
tagggccgaa 
ttcctctcgg 
acttaggagg 
ccaagtcctc 
atgacgtcaa 
gtgttggttc 
cttatgtgtt 
agaaactttg 

ggggatttga 

ttcaccaaaa 
aagcaatgtt 
taaagtcaag 



gcaggtcttg 
cactcagctg 
aaaatattag 
ggtgctgctt 
tactgctccg 
ctttgaaata 
gtttattcaa 
aagctatgat 
aaacacaaag 
acgatttctg 
ggatgcagct 
agaaacagaa 
ttatgnggaa 
tgaagaatct 
tattctttag 
aagataaaag 
ttcccagaag 
aaaatttggg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1115 



<210> 148 
<211> 2448 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<223> Incyte ID No: 



LI : 757520 . 1 : 2000MAY01 



<400> 148 



71/228 



BNSOOCID: <WO_0162927A2_I_> 
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tgagcctcgc actctgccgc ccgcaccacc ttccgctgcc tctcagactc tgctcagcct 60 
cacacgatgt cgtgccgctc ctacaggatc agctcaggat gcggggtcac caggaacttc 120 
agctcctgct cagctgtggc ccccaaaact ggcaaccgct gctgcatcag cgccgccccc 180 
taccgagggg tgtcctgcta ccgagggctg; acgggcttcg gcagccgcag cctctgcaac 240 
ctgggctcct gcgggccccg gatagctgta ggtggcttcc gagccggctc ctgcggacgc 3 00 
agcttcggct accgctccgg .gggcgtgtgc ggacccagcc ccccatgcat cactaccgtg 3 60 
tcggtcaacg, agagcctcct cacgcccctc aacctggaga tcgaccccaa cgcacagtgc 420 
gtgaagcagg aggagaagga gcagatcaag tccctcaaca gcaggttcgc ggccttcatc 480 
gacaaggtgc gctt.cctgga gcagcagaac aagctgctgg agaccaagtg gcagttctac 540 
cagaaccagc gctgctgcga gagcaacctg gagccactgt tcagtggcta catcgagact 600 
ctgcggcggg aggccgagtg cgtggaggcc gacagcggga ggctggcctc agagctcaac 660 
catgtgcagg aggtgctgga gggctacaag aagaagtatg aagaggaggt ggccctgaga 720 
gccacagcag agaatgagtt tgtcgttcta aagaaggacg tggactgtgc ctacctgcgg 780 
aaatcagacc tggaggccaa tgtggaggcc ctggtggagg agtctagctt cctgaggcgc 840 
ctctatgaag aggagatccg cgttctccaa gcccacatct cagabacctc ggtcatagtc 900 
aagatggaca acagccgaga cctgaacatg gactgcatca tcgctgagat caaggctcag 960 
tatgacgatg ttgccagccg cagccgggcc gaggctgagt cctggtaccg tagcaagtgt 1020 
gaggagatga aggccacggt gatcaggcat ggggagaccc tgcgccgcac caaggaggag 1080 
atcaacgagc tgaaccgcat gatccagagg ctgacggccg agattgagaa tgccaagtgc 1140 
cagcgtgcca agctggaggc tgctgtggct gaggcagagc agcagggtga ggcggccctc 12 00 
agcgatgccc gctgcaagct ggctgagctg gagggc£ccc tgcagaaggc caagcaggac 1260 
atggcctgcc tgctcaagga gtaccaggag gtgatgaact ccaagctggg cctggacatc 1320 
gagatcgcca cctacaggcg cctgctggag ggcgaggaac acaggctgtg tgaaggtgtg 1380 
ggctctgtga atgtctatgt cagcagctcc cgtggtggag tctcctgtgg gggcctctcc 1440 
tacagcacca ccccagggcg ccagatcact tctggcccct cagccatagg cggcagcatc 1500 
acggtggtgg cccctgactc ctgtgccccc tgccagcctc gttccfcccag cttcagctgc 1560 
gggagtagcc ggtcggtccg ctttgcctag. tagagtcatg gagccagggc ttcctgccaa 1620 
gcacctgcct gcctgcatca ctgcactgaa tggcatgtga atggaaaatg tgtgcttgct 1680 
tccagaatct tctggatgtt cctacagagg ggaagaccta cagagggaaa gaccctcggg 1740 
ccgctcccct gcgccttttc atgctaggga gatgcatcct agttgtcctc ctggcagctg 1800 
ttttcagagg cattcccagc ccttcactta actcctactt agctccaaaa tacctgtatc 1860 
caatttgtat tattccccca gctctcaggg acaagaccag tcccccagcg tggtggtcag 1920 
cacggaagct ccaccttctg ggtggaggcg ccatcctaac catccagcca ggccacccac 1980 
aacccgagaa tcagggagaa agtccctccc cagcagcccc ctcctcctgg ctgggaagaa 2 040 
tggtccccca gcaagcactt gcctgttcat tcccgttcat gttttgcttc tctctcagac 2100 
tgccttcctg cttctgggct aacctgttcc aagccaggct cctcaatgtg acctcgcagt 2160 
tgagaagccc attatcgtgg ggcatccttt tgcctacagc ccctggttag ggcactttgg 2220 
acaggtcttg ctat'tcagtg aacctttgta catttcaaag aagactccat ggctgctcca 2280 
gatgccccct tgctgggtgc aggtggggac tgtccaatgc agagctggcg ggacagagag 2340 
ttaagccact tcctgggtct ccttcttatg actgtctatg gggtgaattg ccttctgggg 2400 
ttgtctcgat ctgtgtttca ataaatgccg ctgcaatgca aaaaaaaa 2448 

<210> 149 
<211> 650 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 160570 . 1 :2000FEB18 
<220> 

<221> unsure 
<222> 609 

<223> a, t, c, g, or other 
<400> 149 

gcttccccgg cggctcccgg gcgctgactc tcccgggctt ttccgccttt ctcgctcccg 60 
ccccgccggg cggttcggcg gccggtcccc ggaccccgcc gggagctgct cgtctcaccc 120 
tcgccacggc gcggacaata aattaatgca tcgcgcttga gtggaggggg cggctgggcg 180 
cctcggtggt agctttctct cctggctgga gacgaccaca accgacatgg gctgtttctg 240 
cgctgttccg gaagaatttt actgcgaagt tttgctcctg gatgaatcca agttaaccct 300 
taccacccag cagcagggca tcaagaagtc aacgaaaggt tccgttgtcc ttgaccacgt 3 60 
attccatcac gtaaaccttg tggagataga ttattttggg ctacgttact gtgacagaag 420 
ccatcagacg tattggctgg atcctgcaaa aacccttgct gaacacaaag aactgatcaa 480 
cactggacct ccatatactt tgtattttgg tattaaattc tatgctgaag atccatgtaa 540 
acttaaagaa gaaataacca ggtatagtat tgactttgtt tttgaacaga tacatgcatt 600 
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BNSDOCID: <WO__0162927A2_I_> 



WO 01/62927 



PCT/US01/06059 



gcgtatacna aaagctttat ttaaaacaaa ttaaaattgg gctaaacgtt 

<210> 150 
<211> 257 
<212> DNA 
<213> Homo' sapiens 



650 



<220> 

<221> misc_f eature 
<223> Incyte ID No: 



LI: 3503 98. 3 :2000FEB01 



<220> 

<221> unsure ■ 
<222> 22, 117, 120, 220, 246 
<223> a, t, c, g, or other 

<400> 150 

cagctgctcc tgacactccc anaactgtcg ccagcgggga ggggaggcag tagggactgc 60 
cgaagcctgg ggccctggtg ggtgacaggg cagcaccagc ctgggtccag ccacctntcn 120 
cacaggtgaa ccgcttccac aagatccgga accgggccct cctgctcaca gaccagcacc 180 
tctacaagct ggaccctgac cggcagtacc gggtgatgcn ggccgtgccc cttgaggcgg 240 
tgactngctg agcgtga * 257 

<210> 151 

<211> 1554 , 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> mis c_f eature 

<223> Incyte ID No: LI : 221285 . 1 : 2000FEB01 



<400> 151 

gcctggctgc 

aggagaaaca 

ccaacaccag 

cgcaggcctc 

caaagcgcag 

cttgtgccgc 

cagacagaac 

ctgggagcga 

ctcccttgtt 

tctccgggtc 

gaggcagtca 

gcacctacac 

acagcctgac 

gggtaattcg 

ctctagagct 

aagagcagct 

aaacagaatt 

aaacagatgc 

aagatgtggc 

cagtgttaga 

aagaagagct 

aacgtgctga 

ggcgtaagaa 

ccgcccgagc 

tcagggatag 

catggctaat 



caggggcgtg 
gccgaggaca 
ccctgcaaag 
aaggatgacg 
gcgcacctac 
agtccttccc 
cctgggttgc 
gcccagtata 
gtcgcctctc 
ccacaacacc 
gcccccggac 
ctacaccagc 
gcagccagat 
aggtaacact 
ccagagacaa 
cagaccacaa 
ataccttgaa 
atttttggac 
cacccaaata 
agttttggtg 
ggctaacctg 
agttcaacga 
acagcagtgg 
atttgcacag 
tggctacttt 
gaatgaagtt 



ttatcgcggg 
ccctggaaga 
ccacattcca 
gtcaag'ccag 
ctgggtggcg 
gggaaacctg 
tgggcaaccg 
gctcccacct 
ctcgcgtcgc 
ttagcgccca 
atggcctcag 
cggccccgag 
gaagaaccta 
tatgcactcc 
cgggaggcta 
acacctgaac 
gaaattgctg 
agaccaccaa 
ctagaaggag 
gggaagacaa 
cgggccagtc 
cttgaagagc 
gaaataatgc 
cgttacctgg 
tatgatccca 
gaaaaaacca 



ggcagcctgg 
gcaggagcac 
aagccgggcc 
ccaaggccgc 
cggctgggcg 
gggaccggaa 
atgcggcccc 
cc tgcgctgg 
ggaatccttg 
cctgcttcaa 
cgctgactga 
cactgccctg 
tgcattatgg 
agacagggcc 
ggaagagggc 
ctgtggaagg 
atcgcataat 
caccactctt 
agctctttga 
ttgagcagtc 
agcgtgagta 
aagagaggcg 
acaagcacaa 
ctgaccttct 
ttgaaagaga 
tggaatatag 



gagcgctgct 
gtgggcctgc 
aaggacagcc 
ctccctagcc 
gagtcaagag 
ctgcccggag 
cgcagctggt 
taacctcccg 
cccttggcac 
agccaagctc 
tcgcacctct 
ccagcgcagc 
aaacataatg 
actgctcgga 
tcttgccaga 
cagaaagcat 
agaagttgat 
tattcctgcc 
ctttgatctt 
tcttctggaa 
tgaagaacta 
acaccgagaa 
cgagacatca 
cccgtctgtt 
tattgagata 
catggtggga 



gcgccgcggc 
gagaatggcg 
accgttgccc 
aggaacctcg 
ccggaagtcc 
ttcccgcccc 
ctctgtggtg 
tcccgcccgc 
tacttacatc 
caccgaaagc 
cgggccccga 
cgttaccggg 
tatgacagaa 
cggcctgatt 
aaacaagccc 
gtcgatgtgc 
atggaatgcc 
aaaactggca 
gaagttaaac 
gtaatggaag 
cggaatagtg. 
gaaaaagaac 
caaaaaatcg 
tttggcagcc 
ggatttcttc 
agaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1554 



<210> 152 
<211> 623 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
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<223> Incyte 11^^: LI :401605 . 2 :2000FEB01 



<400> 152 

ctggcagggc tggaacatga agaaagagaa gaattttatt tatctacgcc agaaaactac 60 
cactacttga atcagtctgg atgtgtagaa gacaagacaa tcagtgacca ggaatccttt 120 
agggaagtta ttgtaagtta cttttaaatg ttttccctat taagatatca gataccttat, 180 
taagagtaac agatttcatt ttcggtttca aaagattttt ttaaatagaa atagaagtaa 240 
ttccatttgg accagtgggt gagaacaggt atttccttct tgttccaagt catgaacttg 300 
ttccaagttc taggctgaaa ggggtttgag gactcttctt tccttagaag atgagcgaat 360 
gtgtcattca gggcagatgg gcagtttact tgggacagtg tgttcagaaa gtgtaccttc 420 
cacccccaaa aaacccccca aatcatgggc ttcactctgg aaccagcaga tcctccattt 480 
tggggcttac aagtaagatt tctttgaaaa acaaattcta tagctaaaaa caaatctgcc 540 
agccattggc ccaggtgact gttgttgcag aattttggct ctttgttcag ctaaaacctg 600 
ggggtcttgt cacacgatca gga 623 

<210> 153 

<211> 1038 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc__feature 

<223> Incyte ID No: LI : 329017 . 1 : 2000FEB01 
<400> 153 

ctgggctgga ggctctagca gttgttgccc acctggctgt ccatggcaaa agtgggtggg 60 
gtcatgtgca gcctcctgcc acctgagtgt ttcccgggac atcaggaggc tgtatcctgc 120 
agctgaattt acacagaagt ggaacaactg ggctggaagc tctatggcga caaattggct 180 
acttcaagtg gtgacactac agttaaatta tgggatctat gtacaggcga ttgcattttg 240 
acctttgaag gacacagccg cgcagtgtgg tcctgcacat ggcactcctg cggcaatttt 300 
gtggcttcct cctcactgga taaaactagc aaaatttggg atgttaatag tgaaagatgc 3 60 
agatgtactt tgtatggaca tacagattct gtgaacagca ttgagttttt tcctttctcc 420 
aatactcttc tcacaagctc tgcagacaag accctgtcta tatgggatgc aagaacaggt 480 
atatgtgagc agtcacttta tggtcacatg cattctatca atgatgccat ttttgatccc 540 
aggggtcaca tgatagcatc ctgtgatgcc tgtggggtta caaagctgtg ggactttcgg 600 
aagctgttac caattgtgtc catcgatata ggtccaagtc ctggcaatga ggtgaatttt 660 
gattcatcag gtcgagtttt agctcaggca agtggcaatg gtgttatcca tttgctagat 720 
cttaaatctg gggagattca caaattgatg ggccacgaaa acgaggcaca cacggttgtg 780 
ttttctcacg acggggagat tctcttttct ggaggctctg acggcacagt tcgaacgtgg 840 
tcttgaccgt cagcacatcc cgctgcagag ggcattccct ttaaggcttg aaaatgcctc 900 
ctgtagtccc aaaccagcaa agaactggtt aaatgtgcca gcaggtaaca tttacagtct 960 
gctttaaaac atgctttgga catatatttt agtgctcata ctgttactaa taaaaaaata 1020 
actttatgct caaaaaaa 1038 

<210> 154 
<211> 737 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 401322 . 1 : 2000FEB01 
<220> 

<221> unsure 

<222> 601, 625, 633, 698, 700, 711 
<223> a, t, c, g, or other 

<400> 154 

gcgacccaat gggagtcccc gccaggagac acaaccggcg gctctccggg gatttataag 60 

gtcccgccag caaccgcgcg ttgggattcg tggtgttgct ggagacccgt atggtgtctg 120 

cagtcagcga ccctccaact tcaactactg gatcaaccat gtgtgagtgc atctccatcc 180 

atgtgggcca ggccggagtg cagatgggca atgcctgctg ggagctgtac tgcctggagc 240 

acgacatcca gcccagtggc accatgccca gccacaaggc cctggggagc agtgataact 300 

ccttcaacac cttcttcagg gagacccagc ctggcaggca tgtgtcctgg gctgtctgtg 360 

gacctggagc ctgctgtcat aggttggcat caactaccag tcccccacag tggtgcccgg 420 

gggtgctgta gccaaggtgc agcgggcagt ctgcgtgcta aacaatacca cagccatcac 480 
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tgaggcctgg ggcccgcctc aaccaaaagt ttgacctgat gtatgccaag cgggcattta 540 
tgcactgtta tgtggacagg ggcatggagg aaggtgagct ctctgggcct gagaggacct 600 
ngctgccctg gagaaggatt atgangtagg canagaatct ttggacagcg aggatgagca 660 
tgaggagttc taggggtcct ggcacctgta iacacctgntn tctcgcacca nagttgtgtg 720 
tgtatctcat tccactg 737 

<210> 155 

<211> 2275 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LG: 403409 . 1 : 2000MAY19 



<400> 155 

gtggccgcgg 

aagtgaaagt 

aatgtgtggt 

ccaaaggaga 

tggatgaatc 

agaatatcct 

ctggctctgg 

gactttgcag 

aagtagaagt 

aaggaagccg 

gactttctaa 

acaaatctcg 

ttttcaaaat 

aagtgggcaa 

ctgcagggga 

tggttatctc 

atcgtgactc 

ccatggtggc 

ggtatgcaga 

cccgaattat 

cagaggcaat 

aggaaatgac 

gacagaaaca 

atgataaatg 

actatttaaa 

gcatgggaat 

tgactcctca 

agctacacca 

ctaaaaagaa 

agaatagttc 

ttaactttaa 

atccgatgca 

cactggagcg 

ctcctgagaa 

agcaacgctt 

ggctgaggga 

aggagctgga 

atgccaacag 



cagacggaag 
ggcggtgcgg 
ggatgtggat 
tgcccggggc 
tgtcaaagaa 
gcagaatgct 
aaaatcttat 
tggactcttt 
gtcctacatg 
tcagacgttg 
actggctgtc 
cacagttgct 
caccctcaca 
actcagcctg 
caggctgaag 
agctcttgca 
agttctcact 
tactgtgagt 
tcgagccaag 
ccgggatctc 
gaaatctcca 
tgtgacctgg 
gcttgagagt 
cttccttgtg 
ggaacataca 
tcttcctgaa 
gaagaacacc 
tggggacagg 
aaagaaagca 
tgagcagctg 
ttacgaatac 
gtccatatta 
ccagaggctt 
gcagaactgc 
aagacagtgg 
acaaattgtt 
taaaagaaca 
gaagcgaggc 



ccgaacgagt 
atacgaccca 
gcaaacaagg 
cagccgaagg 
aagtatgcag 
tttgatggct 
accatgatgg 
gaacgaactc 
gaaatttata 
aaagtcagag 
acaagctaca 
gcaaccaaca 
catactctct 
gtggatttag 
gaagggagca 
gatcagagtg 
tggctgctca 
cctgcagctg 
cacattgtaa 
cgggaagaag 
gagctaaagg 
gaggagaaat 
cttggaatat 
aatctgaatg 
ttgatagggt 
cactgtatta 
agaacatttg 
atattatggg 
gaacgagagg 
gatgtagacg 
gcacagatgg 
aacagcttag 
atgtatgagc 
cggagcatgg 
gctgaggaga 
aaggccaatc 
gaatacaaag 
tctcttctta 



tcctcggcgg 
tgaaccggcg 
ttattcttaa 
, tgtttgctta 
gtcaagatat 
acaatgcatg 
gcacagctga 
agaaagagga 
atgaaaaagt 
agcatagtgt 
aggatattga 
tgaacgagga 
acgatgtgaa 
ctggcagtga 
acattaacaa 
ctggcaaaaa 
aagacagcct 
ataactatga 
accacgctgt 
ttgagaaact 
accggctgga 
taaggaaaac 
ctcttcagtc 
ctgacccagc 
cagcaaattc 
tagacatcac 
taaatgggtc 
gaaacaatca 
atgaggacca 
gagactcctc 
aggtcaccat 
aacaacagca 
acgaattgga 
acaggttttc 
gagaagcaac 
tattggtgag 
ttaccctaca 
gtgagcctgc 



ctgcaggatg 
agagactgac 
tcctgtaaat 
tgatcattgt 
tgttttcaag 
tatctttgcc 
ccaacctgga 
aaatgaagaa 
tcgagacctt 
gttgggacct 
gtcgttgatg 
gagtagccga 
gtctgggaca 
acgagcaacg 
gtccctcaca 
caagaataaa 
cgggggtaac 
tgaaaccctc 
ggtgaatgag 
ccgggagcag 
agaatctgag 
ggaggagatt 
ttcgggaatc 
tctgaatgag 
ccaagatatc 
gtcagaaggc 
atctgtctcc 
tttcttcaga 
ggatccctcc 
cagcgaggtg 
gaaggccctg 
tgaagaagaa 
gcagctccgg 
tttccactcg 
gttgaataac 
agaagctaat 
gattccagcc 
aatccaggtg 



ggggactcca 
ttgcatacca 
acgaatcttt 
ttctggtcta 
tgccttggag 
tatggacaga 
ttaatcccaa 
cagagtttta 
cttgatccca 
tatgtcgacg 
tctgagggta 
tcccatgcag 
tctggagaga 
aagacaggc£ 
accctcggtc 
tttgttccat 
agcaagaccg 
tcaactctgc 
gaccctaatg 
c tgaccaaag 
aagctaatcc 
gcacaggaac 
aaagttgggg 
cttctggtgt 
caactgtgcg 
caggttatgc 
agtccaatac 
ctcaatttgc 
atgaagaacg 
tccagtgaag 
ggcagcaatg 
aaacgatctg 
agaaggctgt 
cccagcgctc 
agcctgatga 
tacattgctg 
tccagcctgg 
agaag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 * 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2275 



<210> 156 
<211> 488 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 
<223> Incyte ID No: 



LG : 233933 .5 : 2000MAY19 



<400> 156 

ctccacgcca gcacttttgc atgcagtact cccttcaggt cgtcacagcc acctgcaagt 60 
ttgggatgaa tgcgcttctc ctgtctgcct ggttcggcca cttacgaatc ctccagatct 120 



75/228 



BNSOOCID: <WO_0162927A2_I_> 



WO 01/62927 ^ 

tggtaaactc aggg^Kaag atccactgtg agagcaagga 
tggctgctgg tcggggccat atggctgtgc tgcagcgact 
tggaggagca gaatgcggaa ggtctgactg ccctgcattc 
ctgactgtgt gcagctcctc ctcagggctg ggagcaccgt 
acctaagctg ccttcactat gcagccctca gtggctcgga 
tccacgcagg aggctgcgcc aacgtggttg atcatcaggg 
ctgtgagg 

<210> 157 
<211> 5106 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc__feature 

<223> Incyte ID No: LI : 29 0344 . 1 : 2000MAY01 
<400> 157 

gggaagtggg agagacagag ggagagtgtg catacagcag ctgtcgggca tccctgtctc 60 

agggacttct tcgctgattc acagaggcag cccagccccg gcctcccagc ttacagctgc 120 

cgcctgcttc tcagaacctg acggaatcag aagctacatg atattcgccg tgtgagtggc 180 

tatcgtaata gataacattt gaatatctac taaatgccag aaactatgaa ttaactcctc 240 

tcaacaaagc taagaggtgc ttgatgatca atagttggtg cactgcctat cctcatcccc 300 

cccatgagcc ttggttttta tgcaacctat ggtcatgcag ggatgccctt acaccctccc 360 

acgatgtcat gactggcagg cagctgacca gttccatcat agcagcagcc tccgaagcac 420 

ctgcccccac cctcaggtta gagctgctgt caccagccct gcacctcctc aagatggtgc 480 

tggggttccc tgcctaagcc taaagctgtt gaacgggtct gttggtgcct ctggacccct 540 

ggaaccacca gccatgaatc tgtgttggaa tgaaataaaa aagaagtctc acaacctccg 600 

cgctcgccta gaggccttct cagaccacag tggaaagctt cagctccctc ttcaagagat 660 

tattgactgg ctcagccaaa aggatgagga gttgtcagct cagctgcccc tacaggggga 720 

tgtggccctg gtgcaacagg agaaggagac acatgcggcc tttatggaag aagtcaagtc 780 

tcgggccccc tacatctatt ctgtgctgga gtcagctcag gccttcctgt cccagcaccc 840 

atttgaggag ttagaggagc ctcattctga gagcaaagat acctccccga aacagcggat 900 

ccagaatctc agccgctttg tatggaagca ggcgacggtg gccagtgaac tgtgggagaa 960 

gttgacagcc cgctgtgtgg accagcaccg tcacattgag cggactctgg agcagctctt 1020 

ggagattcaa ggggcaatgg aggaactaag cac tact eta agecaagctg agggagtccg 1080 

agecacttgg gageccattg gggatctctt cattgattca ctcccagagc acatccaggc 1140 

tattaagctg ttcaaagaag aattctcccc catgaaagat ggagtaaagt tggtgaatga 1200 

tctggcccac caacttgcca tttctgatgt gcacttgtca atggagaatt cccaggccct 1260 

ggaacagatc aacgtccgat ggaaacaact acaggegtea gttgatgaga ggcttaagca 1320 

gctccaggat gcccaccggg actttgggee tgggtcacag cactttctct cctcctctgt 1380 

ccaggttccc tgggaaagag caatttcacc caataaagtt ccctactaca tcaaccacca 1440 

ggctcagacc acatgctggg accatcccaa gatgacagag ttataccaaa ccctagctga 1500 

tctgaacaac attaagttct cagcttatcg cactgccatg aaactccgca gagtccagaa 1560 

agccctgcgc ttggacctgg taactttaac cacagccctg gaaatcttca atgagcatga 1620 

tctgcaggcc agtgagcacg tgatggatgt ggtagaggtc attcactgcc tgactgeett 1680 

atatgaacgt ttggaggagg aaagaggcat cctggtcaat gtgccactct gtgtggacat 1740 

gagectcaat tggctcctca atgtttttga tagtggtcgc ageggaaaga tgegggcatt 1800 

gtcttttaag actggcattg catgcttgtg tggcacggaa gtgaaggaaa aacttcagta 1860 

cctcttcagc caagtggcca actcaggcag ccagtgtgac cagcgccacc ttggtgtcct 1920 

gcttcatgag gecattcagg tgccccgtca gctgggtgaa gtggcagcct ttgggggcag 1980 

caatgtggag cccagtgtcc gtagttgctt ccgttttagc acegggaage cagtcattga 2040 

agcatcccag ttcctggagt gggtcaacct ggagccccag tccatggtgt ggctgcctgt 2100 

tetgeategg gtaaccattg ctgagcaagt gaagcatcag accaagtget ctatctgtag 2160 

cgcagtgccc catcaagggg ttcaggtacc ggagtctgaa gcaattcaac gttgacatct 2220 

gccagacctg cttcttgaca ggcagggeca gcaaaggcaa ttaagctgea ctaccccatc 2280 

atggcagtat tacacaccga ccacatccag tgagaacatg agggactttg ccacaacctt 2340 

aaagaacaaa ttccgctcca agcattattt cagcaaacac cctcagcgag gttatctgee 2400 

tgtgcaatca gtgctggagg ctgactacag tgagacgccg gcttcttccc cgatgtggcc 2460 

acacgccgac acacactccc gaattgagca ttttgecage aggcttgetg agcatggaaa 2520 

gtcaaaattg ctccttcttt aatgaaaggt tgttccccaa ttgaaagctt aaacgaggac 2580 

cagtacctgc tgcggcactc cagccccatc acagaceggg agccagcctt tggacagcag 2640 

gctccatgca gtgtgggcac agaaagcaaa ggggagctac agaagatcct ggcccacttg 2700 

gaagatgaga aceggattet ccagggagag ctgaggcgcc tgaagtggca gcatgaggag 2760 

gcagctgagg cacccagtct ggctgaegge tccactgagg cagcaacaga ccaccgcaat 2820 

gaggagcttc tggecgagge ccgtatcctt eggcaacata agagccgcct ggagacgege 2880 

atgeagatec tcgaagatca caacaagcag ctagagtccc cagctgcagc gtcttgaggg 2940 



f 

ggsac 



PCTAJS01/06059 



ggggaacact gcccttcatc 180 
tgtggacatc gggctggacc 240 
ggctgctgga ggatcccacc 3 00 
gaatgccctc acccagaaaa 360 
ggatgtgtct cgggtcctca 420 
tgcctctcct ctgcacctcg 480 

488 



76/228 



BNSDOCID: <WO 0162927A2J_> 



WO 01/62927 



PCT/US01/06059 



agcttcttcc 
gcttcctctc 
agataccgag 
cggacatcat 
gcgcaacacc 
ctcctggttc 
ctggccccac 
ggtatgtgag 
cctagaaagt 
tgcagcccag 
cagaagctaa 
gaagcctaat 
actctctggc 
aagtggcccc 
gattgctagc 
tttagcctca 
ctaccgttat 
cggcacatgt 
cgagttgctc 
cagacctgag 
taaggaagag 
cgcttcccaa 
atttgtcccc 
ataggagcca 
aaggccttcc 
actgaactag 
aatagatagt 
aggcagcttt 
ctaagtttat 
cctcactcac 
gttagcccca 
ctctccattg 
tcacagagaa 
ttgagacctg 
aatagagcaa 
agcctttgcc 
tcactc 



tgcagccacc 
cacagcagtc 
gctgcatgat 
ggagaaactc 
ctgctggcct 
cgggtcaaag 
attcctcaac 
gtgcatgcgg 
gttcccttta 
ggcaaatatc 
gcggtacatg 
taccgccaag 
gcaactcagg 
tgaaaaagtc 
catcctaatg 
gtccctgaca 
atttggaagc 
gtcctatttc 
tggaaacctg 
gacacagtgg 
gaagcccacc 
ggctgtgacc 
agctaafccat 
caggccacaa 
tgtgcctaaa 
ctcttcctag 
aggggtgggg 
ctcccaagag 
atacatacat 
acattaaatt 
gcctctggtt 
atgtttggcg 
tctaacagga 
ggccttgggt 
acccaagctc 
agctaagaca 



caccgaatca 
agaaggccag 
gatgtggggt 
cgtcatgcct 
cttgatggag 
cctttcctca 
tagtattatt 
gcagtgatgg 
atcttcaagt 
acttggccat 
ttggcccctc 
gttccatcca 
gaggcaaaag 
tgggcagtgt 
ctacctaggt 
gaccttcctt 
attaaatcca 
gttaaagtcc 
gctgaagcca 
ccagcttgcg 
tcaagtcttg 
cctttctagc 
aaggcatttg 
atcaggaagc 
attgttattt 
gcgacatata 
gagtgggctt 
gagctttata 
atctattcta 
ctaatacctc 
ggaaagatag 
gtgggctgtc 
agagtcactg 
gaattttcct 
tgcccacccc 
cagagaacat 



gatggcagtg 
tcacccccgg 
caaagagcca 
tcccccagtg 
ccagatcccc 
aggcttcacc 
tgggcatctg 
gagaagggga 
tcgagatcag 
ttcagatggg 
ttccttctcc 
gcattatgaa 
ggtatccccc 
gtgtactatt 
ataggtttct 
aagggggaga 
tacttatggg 
tcagcatttt 
cccttgtgct 
ttgagcaagg 
gaagtagtta 
ttctcaggct 
caactggcta 
gtggccactt 
ctaactctgg 
ccatgatctc 
gggaatgttg 
acctagagcc 
catatctact 
ttgaagtatt 
atgtatcctg 
ctcaagacca 
ttctcttcac 
gggctcccca 
atgcagcaaa 
cctctggctc 



gctctgcagg 
gagaagggac 
ggatgtcagc 
tgcgaagttc 
atcctatagt 
caacctttcc 
ggcagcagca 
gcgcatgatt 
ccctttaagt 
gagcagagaa 
gtgggaatta 
tccaacaggg 
gagacacctt 
aacttcagct 
cc.tcaaatat 
acagttcatt 
tttagaatgt 
gagaggagcc 
cctctgagtc 
gagtgtggct 
tgacctgcca 
gaatcaggtg 
ccgtgtgtag 
tccagccaga 
a&ggtgaatc 
tttatccaga 
aatagattca 
ctgcagttga 
gtacatcagt 
aatgttctgc 
gctgctcccc 
aagcaacagg 
tcattccaaa 
taaatggttc 
aggacaaaca 
ccttccagac 



ctcgtcccta 
agactactcc 
ctgtgcttgg 
tgatgtgact 
tcatagtcct 
cagtttccac 
gggatctggt 
ctttcttgac 
acctttctgt 
gcctgccttg 
tggaaagaga 
ttttacccct 
ccccaccccc 
ccagggacag 
cttcattttc 
ctgggttctt 
ttgtttttgc 
cactgagttg 
tgtgcctcag 
ttcctgtggg 
tgcccacaac 
aaggtggctg 
tggcccctcc 
caacaacagc 
caatcctcaa 
aactgttcag 
gagcccactt 
cttcatttgc 
gctctcctgg 
cttgctcttt 
taggatatct 
aaacctgtgg 
tcttgttccc 
aggtgccaac 
ggccttcatt 
tcttatggtc 



3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5106 



<210> 158 

<211> 3800 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
<223> Incyte ID No: 



LI : 410742 . 1 : 200 0MAY01 



<220> 

<221> unsure 
<222> 3787 
<223> a, t, c, 



g, or other 



<400> 158 

agaccatctg 

attaactgca 

aaatcagaga 

gtggtcatat 

gagatgacta 

ccctgcagag 

tacccaacca 

ttttggaatc 

ccgggaggaa 

tgtagagaaa 

attttcacaa 

gatgatacta 

caaaattgtc 

ggaagtccgt 



tagactcctt 
gacggctgtc 
caatgatggg 
cggttcatgg 
tgcaacaagc 
aaggagaaga 
acccagaaaa 
tttgttgttg 
actccaagct 
ccacttatac 
caaattttca 
aatgccacag 
tctcaggaac 
actttgacca 



cggaaagcag 
aggatagaag 
gctcttcccc 
agaattgcga 
taaaagaagg 
taactcaaaa 
tcacctaccg 
acaaaaacac 
tcctgatcac 
taacggttaa 
tgggtgaaat 
atgcagatga 
cagcaggcac 
attctcttga 



cagagatgct 
cagcggctca 
agaactacag 
atagagacta 
caaaaacgtg 
agaaacccaa 
aatctctgga 
tggagatatt 
atgtcgggct 
aattttggat 
tgaagaaaat 
accaaaccac 
acccatgttc 
^ccgagagcaa 



gcagagggtt 
cttggacttt 
gggctctggc 
aaggtcaata 
aatgggtgaa 
ttgccaagat 
gtgggaatcg 
aacataacag 
ctaaatgcoc 
attaatgata 
agtgcctcaa 
ttgaattcta 
ctcctaagca 
gctagcagct 



tttcttagac 60 
ttcaccaggg 120 
catcttcgtg 180 
tgatgaagaa 240 
atttgccaaa 300 
tacttcagga 360 
atcagccgcc 420 
ctatagtcga 480 
aaggactaga 540 
atcctccagt 600 
acrbcactggt 660 
aaattgcctt 720 
gaaacactgg 780 
atcgtctggt 840 



77/228 



BNSDOCID: <WO_01 62927 A2J_> 



WO 01/62927 



f 

igaTaa 



PCT/US01/06059 



tgtgagtggt gcagaTTaaag 
agtgaaagat gtcaacgata 
tgaagaaaat attttaagtt 
gtacacagat aattggcttg 
tgaaatacaa acgtgatcct 
attatgaaca actacaaagc 
accaatcagt tatctctcga 
atgtaagaga aggaattgca 
taagtagcaa aaaattggtg 
ctaacaaagc tgcctcaaat 
tgattgattc aaaaactgct 
tcatagttaa caaaacaatc 
cttctacagg cacggtatat 
tcctcgaaaa agatgcagtt 



tgaataatag 
ctgccgtatg 
agatacctcc 
gtgagatgcc 
gaacttctta 
tggggcctgc 
ttctgctgtt 
tcccagttcc 



atacactggc 
gagtatcaca 

tggagtatac 

acgcagcttg 
cccaaccaca 
cgccatcggc 
gacctgtgac 
tgatggctca 



ctgaagacaa ggaaatcaca 
tcatggaaag ttctgaagtt 
cttcagggaa tggaaatgac 
ggctttgcaa cagggacagt 
ggcatttggt ccctcaggcc 



caataaggac 
gaaagcattt 
ctatgataat 
gttttattgc 
aaacttgccc 
agttcagccc 
catcccatat 
ctaagggagc 
ttcccttaac 
tcttggttcc 
gagaatgtga 



tacgctgatg 
gcctgtgcgg 
gaaggcgcag 
tgatgacctg 
agagataaag 
ccacccccct 
aaaagtccag 
ttcgtgtttg 
cttctggagc 
ctcggcgcaa 
aaagtgccag 



atggagaagg 
acttcccaat 
ctgaattact 
cagtatattt 
agaactaatg 
gtgaaactta 
taccgagttc 
ttccgtcctg 
gattatatcc 
gtcaaatatg 
gaaatcaaat 
acagctgagg 
gttagagtac 
tgcagttctt 
ccctatacat 
accctcaatg 
cacatctccc 
acactggaag 
agccctggga 
ctgctgctcc 
tgtggggcag 
gaaggaacaa 
aatatttgtg 
tgtacaaata 
cactaagctt 
gtcaggagct 
agtctggaac 
gggcgataag 
aggaagacga 
atgccactgg 
gatgacagct 
ccccttttgg 
tgctaaagac 
cagacaggat 
gtccgcctct 
atggtaacta 
ccgttccagc 
aaaaagggtg 
aacgcagcta 
aatatgaacc 
tgggagctaa 
tttggccact 
ggttcatacc 
caaacttgtt 
cgcgaaattg 
gaaggtgttt 
gtagcgctgg 
ggtaaggaat 
cttgtgggag 



actatcaact 
gtttagagac 
tcgatttcaa 
ctttacctct 
,aaggcatcct 
gtattgctgt 
agtcaacccc 
cttccaagac 
tgggaacata 
tcatgggacg 
ttgtcaaaaa 
ttctggccat 
ccgatttcaa 
caccttccgt 
ttgcactgga 
ctacctcggc 
tggtacttac 
tctgtcagtg 
ccaggtatgg 
ttggtctcct 
gttctactgg 
ttcatcagtg 
tgcctcctgt 
cgtatgccag 
ggagcagcca 
gtttcaggat 
catgagaaca 
catgaatttt 
tggccaggaa 
tgggtgctgt 
tcttgggact 
ttgtttttga 
cagcggttat 
ttggtaagtt 
gggtcctgtc 
ctttagttaa 
tgcaggcttt 
actctgtcgc 
cgaggcgtca 
cagaatgagc 
agtattcaaa 
tattagcttc 
cgcaaaagcc 
agtaaatcct 
ttccgtaaac 
taatcacttt 
gttcattgtc 
caacaaaggg 
gccagaggca 



caatgcgaat 
tctcagtatt 
gtaacagatt 
gggaatgaag 
gaaagtggtg 
caaaaacaaa 
agtcacaatt 
atttactgtg 
tcaagccatc 
taacgatggt 
tatgaaccga 
agatgaatac 
tgacaattgt 
ggttgtctcc 
agate aacct 
cctcctcaga 
agacagtcag 
tgacaacagg 
caggccgcac 
getgetgetg 
gggagtgaca 
gggaattgaa 
aacagecaat 
aggcacagcg 
ctgaatctgg 
teggagcage 
aggcattcca 
ctggactcct 
gcaaatgact 
gggctccgtg 
cacttggacc 
aattgggttg 
gggattgaga 
gecaaacttt 
cagccaagct 
egggagattt 
tgatcccact 
cagtttccca 
cattactatg 
gtggagatat 
catatgeata 
tctcataaaa 
ccataatgtt 
taaaggtttt 
agttcttaag 
taaaaatgea 
gtccaattgc 
gcccgggcgc 
aggtggatct 



gtaatattaa 
cagcaegtat 
tggatgaaga 
gaaattggtt 
aaggctctag 
gctgaatttc 
caggtaataa 
caaaaaggca 
gatgaggaca 
ggatacctaa 
gattctactt 
aegggtaaaa 
ccaacagctg 
gctagaacac 
gtaaagttgc 
gcccaggaac 
aacaateggt 
ggcatctgtg 
tcagggaggc 
ttggcccccc 
ggtggtttta 
ggagcccatc 
ggagecgatt 
gtggaaggca 
aggtgetgea 
cactggagtt 
ctggaggaac 
acttttctca 
gcttgtggat 
ggttgttgca 
caaatttaaa 
aaggcaaaga 
tcctgtgggg 
gtcaggaagt 
ggtttcccaa 
atctccggct 
gtctccacac 
gtgtctccgt 
ctccggtaca 
ccacactgga 
gcaaacgctc 
tggattcacg 
gtcactccct 
tcacaaaccc 
cccttctcat 
aatagtgaat 
ctgtttctgt 
cggtggctca 
atgaggtcta 



900 
960 
1020 
1080 • 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3800 



ggccaaccta agctgggccc 
gaggatcctt gctcccgtct 
gecaaattet ggggatcttg 
acgtgtattt gggctaataa 
attataaatt aaatgttctg 
aattctcaca ggtacttatt 
ctcaaatcta tattccgcca 
ttttcccccc tgccccaacg 
tttagtggga gttaaggcag 
atatgeagtt gatacatgat 
cagccatgta atcgccagca 
ggagatnega gaccatcctg 

<210> 159 

<211> 2281 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 406568 . 1 : 200 0MAY19 
<220> 

<221> unsure 

<222> 446, 875, 1361 

<223> a, t, c, g, or other 

<400> 159 

cagcctgcca cttgcctccc tgcctgcttc tggctgeett gaatgcctgg tccttcaagc 60 
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tccttctggg tctgacaaag cagggaccat gtctaccttt ggctaccgaa gaggactcag 120 
taaatacgaa tccatcgacg gaggatgaac tcctcgcctc cctgtcagcc gaggagctga 180 
aggagctaga gagagagttg gaagacattg aacctgaccg caaccttccc gtggggctaa 240 
ggcaaaagag cctgacagag aaaaccccca cagggacatt cagcagagag. gcactgatgg 3 00 
cctattggga aaaggagtcc caaaaactct tggagaagga gaggctgggg gaatgtggaa 360 
aggttgcaga agacaaagag gaaagtgagg aagagcttat ctttactgaa agtaacagtg 420 
aggtttctga ggaagtgtat acagangagg aggaggagga gtcccaggag gaagaggagg 480 
aagaagacag tgacgaagag gaaagaacaa ttgaaactgc aaaagggatt aatggaactg 540 
taaattatga tagtgtcaat tctgacaact ctaagccaaa gatatttaaa agtcaaatag 600 
agaacataaa tttgaccaat ggcagcaatg ggaggaacac agagtcccca gctgccattc 660 
acccttgtgg aaatcctaca gtgattgagg acgctttgga caagattaaa agcaatgacc 720 
ctgacaccac agaagtcaat ttgaacaaca ttgagaacat cacaacacag acccttaccc 780 
gctttgctga agccctcaag gacaacactg tggtgaagac gttcagtctg gccaacacgc 840 
atgccgacga cagtgcagcc atggccattg caganatgct caaagtcaat gagcacatca 900 
ccaacgtaaa cgtcgagtcc aacttcataa cgggaaaggg gatcctggcc atcatgagag ,960 
ctctccagca caacacggtg ctcacggagc tgcgtttcca taaccagagg cacatcatgg 1020 
gcagccaggt ggaaatggag attgtcaagc tgctgaagga gaacacgacg ctgctgaggc 1080 
tgggatacca ttfctgaactc ccaggaccaa gaatgagcat gacgagcatt ttgacaagaa 1140 
atatggataa acagaggcaa aaacgtttgc aggagcaaaa acagcaggag ggatacgatg 1200 
gaggacccaa tcttaggacc aaagtctggc aaagaggaac acctagctct tcaccttatg 1260 
tatctcccag gcactcaccc tggtcatccc caaaactccc caaaaaagtc cagactgtga 1320 
ggagccgtcc tctgtctcct gtggccacac ctcctcctcc ncgggactcc tccactccca 13 80 
gagaaaaagc tcattaccag aaacattgca gaagtcatca aacaacagga gagtgcccaa 1440 
cgggcattac aaaatggaca aaaaaagaaa aaagggaaaa aggtcaagaa acagccaaac 1500 
agtattctaa aggaaataaa aaattctctg aggtcagtgc aagagaagaa aatggaagac 1560 
agttcccgac cttctacccc acagagatca gctcatgaga atctcatgga agcaattcgg 1620 
ggaagcagca taaaacagct aaagcgggtg gaagttccag aagccctgcg ataaaaacat 1680 
gatctttaga agaggatgca gaactgttca gtggtattac atgaaatgca ttgtgagatg 1740 
tttctaaaat accttcttca attcaaaatg atccctgact ttaaaaataa tctcacccat 1800 
taattccaaa gagaatctta agaaacaatc agcatgtttc ttctgtaaat atgaaaataa 1860 
atttcttttt tatgtcgtga gatttgtatt ggcaagaagc agttaattta aagatgctct 1920 
tcctatctgt ggatgtgttg gtaactccga gttgtaatga gttcatgaaa tgtgctgtta 1980 
tttttgtaat ctcaataaat gtggattgaa gttttttccc tttttttaaa gccaaactaa 2040 
tatttttctg tgacttgata catctgtcag atttttgtaa tctcgataaa tgtgtattga 2100 
agttttttcc ctttttttaa aaagccaaac taatattttt ctgtgagtta atacatctgt 2160" 
caggtgtgta tgtaacatta ctggacatta aaaaaaatta ttacattctc acccaaaaag 2220 
ggtttgggtc ctaagcattt gcctttcttt gtttcctctt gttcaagaaa atctgattag 2280 
a ~ ' 2281 

<210> 160 

<211> 5408 

<212> DMA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> incyte ID No: LI : 283762 . 1 : 2000MAY01 
<220> 

<221> unsure 
<222> 4004 

<223> a, t, c, g, or other 
<400> 160 

gcgagcgcgg tcgagatcgc catgcctacc cgagtatgct gctgctgttc cgctttgcgt 60 

cctcgctaca aacgcctggt ggacaacata ttccctgaag atccaaaagt aatttgatct 120 

acatctactg atccttctct ttgctgaccc attctgcccc ccgctgacct ccttaaggat 180 

ggccttgtga aaactgatat ggagaaattg acattttatg cagtatctgc tccagagaaa 240 

ctggatcgaa ttggttctta cctggcagaa aggttgagca gggatgttgt cagacatcgt 3 00 

tctgggtatg ttttgattgc tatggaggca ctggaccaac ttctcatggc ttgccattct 3 60 

caaagcatta agccatttgt agaaagcttt cttcatatgg tggcaaagct gctggaat-cg 420 

ggggaaccaa agcttcaagt tcttggaaca aattctttgg tcaaatttgc aaatattgaa^ 480 

gaagacacac catcctactc acagacgtta tgactttttt gtgtctcgat tcagtgccat 540 

gtgccattcc tgtcatagtg atccagaaat acgaacagag atacgaattg ctggaattag 600 

aggtattcaa ggtgtggttc gcaaaacagt caacgatgaa cttcgggcca ccatttggga 660 

acctcagcat atggataaga ttgttccatc cctcctgttt aacatgcaaa agatagaaga 720 

agttgacagt cgcataggcc ctccttcttc tccttctgca actgacaaag aagagaatcc 7S0 
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tgctgtgctg gctgWKact gtttcagaga actgctgggt cgaglaactt ttgggaatat 840 
gaataatgct gttagaccag tttttgcgca tttagatcat cacaaactgt gggatcccaa 900 
tgaatttgca gttcactgct ttaaaattat aatgtattcc attcaggctc agtattctca 960 
ccatgtgatc caggagattc taggacacct tgatgctcgt aaaaaagatg ctccccgggt 1020 
tcgagcaggt attattcagg ttctgttaga ggctgttgcc attgctgcta aaggttccat 1080 
aggtccgaca gtgctggaag tcttcaatac 'ccttttgaaa catctgcgtc tcagcgttga, 1140 
attcgaaagc aaatgattac agggggggat ctgtagccag tgtcaattta aattacaagt 1200 
tccaaagaca atgatgagaa gattgtggca gaatgctatc atccaaacaa taggattttt 1260 
tggaagtaac ctaccagatt atcagaggtc agaaatcatg atgttcatta tggggaaagt 1320 
acctgtcttt ggaacatcta cccatacttt ggatatcagt caactagggg atttgggaac 1380 
caggagaatt cagataatgt tgctgagatc tttgcttatg gtgacctctg gatataaagc 1440 
gaagacgatt gttactgcac tgccagggtc tttcctggat cctttggtta tcaccatctc 1500 
tcatggagga ctacgaactg agacagttgg tcttggaagt aatgacataa tatcatggat 1560 
cgtcatgaca atagggaaaa gcttcgaggg ataagaataa taacggatgt agctgaccta 1620 
aagataaaaa gagaaaaaat ttgcagacaa gacacaagtt tcatgaaaaa gaatgggcaa 1680 
cagctgtatc gggcacatat atttgggttg taaagaggaa gacaacgttc agaaaaacta 1740 
tgaactactt tatacttctc ttgctcttat aactattgaa ctgggctaat gaagaagtag 1800 
ttattgatct cattcgacct ggccattgct tttacaggac agtgcaatta tcaatgagga 1860 
taatttgcca atgttccatc gttgtggaat catggcactg gttgcagcat acctcaactt 1920 
tgtaagtcag atgatagctg tccctgcatt ttgccagcat gttagcaagg ttattgaaat 1980 
tcgaactatg gaagcccctt attttctacc agagcatatc ttcagagata agtgcatgct 2040 
tccaaaatct ttagaggaag catgaaaaag atttgtactt tctgaccaac aagattgcag 2100 
agtcgctagg tggaagtgga tatagtgttg agagattgtc agttccgtat gtaccacaag 2160 
taacagatga agatcgactt tctagaagaa aaagcattgt ggacaccgta tccattcagg 2220 
tggatatttt atccaacaat gttccttctg atgatgtggt tagtaacact gaagaaatca 2280 
cttttgaagc attggagaaa gccattggat accagtggaa tggaagaaca ggaaaaggaa 2340 
aagaggcgtc ttgtgataga gaaatttcag aaagcacctt ttgaagaaat agcagcacag 2400 
tgtgaatcca aagcaaattt gcttcatgat agacttgccc caaatattgg aactcaccat 2460 
acgtcctcct cccagtccat caggaacact gaccattact tctgggcatg cccaatacca 2520 
atctgtccca gtctatgaga tgaagtttcc agatctgtgt gtgtactgat cggcgcatga 2580 
agacctcagg atatgatttg taaagcctaa aaattaaggc caagctgagc . tttcagggtt 2640 
tacttaatgt gtattaacat acttcttgaa aataatgatg gaacatatct ttaaccaaat 2700 
gtttggcata ccatattaga agttttggag ctatatataa tttcgagtac tttcaaagat 2760 
agatttatgc catgttaatt tgctttgagg ttcctgttgc ctttttaagt tgaacatgtt 2820 
ttggtttcac tttattccac tgttaagtag tatgttttaa acttttcaca aaatgtaatg 2880 
ttttttaaaa agtaagcctt cagaggattg aaactggatt aattggttat ttcttaaaca 2940 
tttaaacagg cggtttagat gtagaatttt tgttgttgtt ttctgcaaag gcagatacat 3000 
ttaaatatat tcctagtcct gggggctgca aaactgttcg gtgggctttt tgtccccagt 3060 
gcgttagata agctggcgat aggcaccttg ctattcagtt actataataa tatgtgatag 3120 
gcattcctca tcttcttcac aataatagga catctgttga atagcattcc tcgaatataa 3180 
ccctaaaaac gccatacttt aaattgtctg gttcttgtaa tattgtgttt cctgccacag 3240 
agtttggacc attctgcttg agaagtgtag cagcttactc ttggagtacc aaactgtgca 3300 
atatttttta catcataaga tgtatacagt ttacaggctg tgcttcgaaa ttcatagtag 33 60 
tattttgctg tggctccatt aattaaatga gatatcatat ttagtgcaga aaacagacat 3420 
ttacaacagc tattcaagtt cacctgtgaa ggagtctgta cattttgatt tctattaaga 3480 
gcacatttat ttacatttgt atattatttg aaatcctcat actcaaaaaa ttctccagat 3540 
gggactttta atttgtaagg atttgaactg tgacttgtat acatctgttt aggaatcaat 3600 
tatatttgaa aagctgcctg tgttttaaca gtcaagtgtg ctaaagtttg tcaatttaag 3660 
gctggctttt gat tt cage t accaagatca caggtgeact ctacacataa cactggacag 3720 
gacccataac attacataca tcttaggtgg aattctatca catgggtaaa atggaacggc 3780 
tttctttgta actcataaaa ttctcttagg acatttttat aaagtcacct gtttatagtt 3840 
ctatcttttc agattccatt tctttttaca taaaacagca tacatatgea aaaactgtaa 3900 
ccttagaatc cagtttaatt tttgggcttt gtttttggtt taaaaaaatt gcagtgaaga 3960 
atggggatgt ttgtgtttat gggctatttg gggcaccttt tagntaggaa acagacaaaa 4020 
gtaaggacaa cgataatagg acaaggcata cttgaaaatt tctggaatac ttaaacaaaa 4080 
gcgcaacatc ttggaaaacc agtctagtca ttgaaaccta tgaaatgaca ctgaaagatc 4140 
ctggggctgc ttcttaaata agtagatcca gcaagacttg tttccagagt gaacagtggg 4200 
agtcgttacc gctggaggac taaagggece tgtgggcagc tgtcactgga actttgcctc 4260 
ttgatcagga aaaatgette accagtccgt aaagccaagt tgtatttttt ttattgeect 4320 
ttttttcctt ctgtattttt aaagaaggat gttaattttt gacctatata ttttaaaaaa 4380 
atctaagcag ggggacatgc aaaaacaatc atccacccac ttgggatgtc atttttatag 4440 
attaacactg tgtgcttttg tatggaaaaa atatatataa tttaatagta taaaacataa 4500 
catatatatt catttgeact tacgtgaaac acaaactttg ctctacaaaa tttcgtgttt 4560 
cttaggtgat tttaaaaatg catgtattgc atgtaaagga aaacccatta ccaattaatg 4620 
tttaatccac acctttatcc ttggtctttg ttggatttgg ggttttgttg ggggtttttg 4680 
ttactgtttt taaattacag taggcttcct atatacccct ggatttctga actggtcttg 4740 
ttgacaagga ttcccaagaa atggatctta tcaactggct gactctccca tatctgeaag 4800 
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acccattctg caggaactgg gtcgtgcaca cggtgttgta gccagttcag gtactgaata 4860 
tttaggattt ggtgaagttc actgtattgc tatatttttg taagataaat aaaactcaat 4920 
taaattgtta atccattctc ttttttgctg tatagaattg cttatatcac tctttctttc 4980 ' 
atgacattgg ttaacattta aatgttcctc ctgtacttgt gttgtcctgt gacccgcctt 5040 
atagagtttt attgttattg gtgtttaccc tgaataccct atgccgtaca cacacacata' 5100 
tttccttaat acttctgaac tcattatctt ttagaataat aatacctacc tacactttac 5160 
ccagcaatta acttcctccc ctaccccaaa atgtttttcc ttccctgtct gaaaatggaa 5220 
ctaatttgtc ttattcgtgc ttatatctgt attaaaatgc aataaaggtt agttttttga 5280 
aatgtaaaat ttccactgtg caatttcata tgtcagcaaa taaaccatag attatttgta 5340 
tataagttca tttttatagc tattgttaca tttaattaaa tttattcaat atttgcttaa 5400 
attaaaaa 5408 

<210> 161 

<211> 2895 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: LI : 347687 . 113 : 2000MAY01 
<400> 161 

agccaacctt acttacagca taagggagcg tagaatctgt gtagacgaag ggggcatctg 60 
gccttacacc tcgttaggga agagaaacag ggtgttgtca gcatcttctc actcccttct 120 
ccttgataac agctaccatg acaaccctgt ggtttccaag gagctgagaa tagaaggaaa 180 
ctagcttaca tgagaacaga ctggcctgag gagcagcagt tgctggtggc actgttctgt 240 
ggctgtggac atgaagccct cactggcaca gaaaagctaa ttgagaccta tttctccaaa 300 
aactaccaag actatgagta tctcatcaat gtgatccatg ccttccagta tgtcatctat 3 60 ; 
ggaactgcct ctttcttctt cctttatggg gccctcctgc tggctgaggg cttctacacc 420 
accggcgcag tcaggcagat ctttggcgac tacaagacca ccatctgcgg caagggcctg 480 
agcgcaacgt ttgtgggcat cacctatgcc ctgaccgttg tgtggctcct ggtgtttgcc 540 
tgctctgctg tgcctgtgta catttacttc aacacctgga ccacctgcca gtctattgcc 600 
ttccccagca agacctctgc cagtataggc agtctctgtg ctgacgccag aatgtatggt 660 
gttctcccat ggaatgcttt ccctggcaag gtttgtggct ccaaccttct gtccatctgc 720 
aaaacagctg agttccaaat gaccttccac ctgtttattg ctgcatttgt gggggctgca 780 
gctacactgg tttccctgct caccttcatg attgctgcca cttacaactt tgccgtcctt 840 
aaactcatgg gccgaggcac caagttctga tcccccgtag aaatccccct ttctctaata 900 
gcgaggctct aaccacacag cctacaatgcf tgcgtctccc atcttaactc tttgccttgc 960 
caccaactgg ccctcttctt acttgatgag tgtaacaaga aaggagagtc ttgcagtgat 1020 
taaggtctct ctttggactc tcccctctta tgtacctctt ttagtcattt gcttcatagc 1080 - 
tggttcctgc tagaaatggg aaatgcctaa gaagatgact tcccaactgc aagtcacaaa 1140 
ggaatggagg ctctaattga attttcaagc atctcctgag gatcagaaag taatttcttc 1200 
tcaaagggta cttccactga tggaaacaaa gtggaaggaa agatgctcag gtacagagaa 1260 
ggaatgtctt tggtcctctt gccatctata ggggccaaat atattctctt tggtgtacaa 1320 
aatggaattc attctggtct ctctattacc actgaagata gaagaaaaaa gaatgtcaga 1380 
aaaacaataa gagcgtttgc ccaaatctgc ctattgcagc tgggagaagg gggtcaaagc 1440 
aaggatcttt cacccacaga aagagagcac tgaccccgat gggcgatgga ctactgaagc 1500 
cctaactcag ccaaccttac ttacagcata agggagcgta gaatctgtgt agacgaaggg 1560 
ggcatctggc cttacacctc gttagggaag agaaacaggg tgttgtcagc atcttctcac 1620 
tcccttctcc ttgataacag ctaccatgac aaccctgtgg tttccaagga gctgagaata 1680 
gaaggaaact agcttacatg agaacagact ggcctgagga gcagcagttg ctggtggcta 1740 
atggtgtaac ctgagatggc cctctggtag acacaggata gataactctt tggatagcat 1800 
gtcttttttc tgttaattag ttgtgtactc tggcctctgt catatcttca caatggtgct 1860 
catttcatgg ggtattatcc attcagtcat cgtaggtgat ttgaaggtct tgatttgttt 1920 
tagaatgatg cacatttcat gtattccagt ttgtttatta cttatttggg gttgcatcag 1980 
aaatgtctgg agaataattc tttgattatg actgtttttt aaactaggaa aattggacat 2040 
taagcatcac aaatgatatt aaaaattggc tagttgaatc tattgggatt ttctacaagt 2100 
attctgcctt tgcagaaaca gatttggtga atttgaatct caatttgagt aatctgatcg 2160 
ttctttctag ctaatggaaa atgattttac ttagcaatgt tatcttggtg tgttaagagt 2220 
taggtttaac ataaaggtta ttttctcctg atatagatca cataacagaa tgcaccagtc 2280 
atcagctatt cagttggtaa gcttccagga aaaaggacag gcagaaagag tttgagacct 2340 
gaatagctcc cagatttcag tcttttcctg tttttgttaa ctttgggtta aaaaaaaaaa 2400 
aaagtctgat tggttttaat tgaaggaaag atttgtacta cagttctttt gttgtaaaga 2460 
gttgtgttgt tcttttcccc caaagtggtt tcagcaatat ttaaggagat gtaagagctt 2520 
tacaaaaaga cacttgatac ttgttttcaa accagtatac aagataagct tctatgcagc 2580 
ctggaaggag gagagggaaa atgttttgta agaaaccaat caagataaag gacagtgaag 2640 
taatccgtac cttgtgtttt gttttgattt aataacataa caaataacca acccttccct 2700 
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gaaaacctca catgcSrcaca tacacatata tacacacaca aagagagtta atcaactgaa 2760 
agtgtttcct tcatttctga tatagaattg caattttaac acacataaag gataaacttt 2820 
tagaaactta tcttacaaag tgtattttat aaaattaaag aaaataaaat taagaatgtt 2880 



<210> 162 

<211> 561 

<212> DNA ' 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 1146510 . 1 :2000MAY01 

<400> 162 , 

tgcataaaaa gcagagcgta gcaagtaagc tgcaaatctg aagaaaatcc atgtgtccac 60 

cagctgccat ctccagaact cttactccag gaaatccgaa agagtgaaca ttcttttagt 120 

ctcctactcc tcaattaagt aaatgagaat gagtcagcca acaaagttca tgacaacaag 180 

gtgcaggatg gtgctggcaa agagaaaatc agcaaaggct cgctctgggg agatgccttg 240 

gaaatccgct ttgttctctg ggttgactct gtaattacat cacggccata ccgctaggat 300 

gaactcccca cacaagagat gaagcccgag agaaaagtag ttgaacgagc ggaatggtcc 360 

ccagcgacgg agacagtgaa ccgaactgca gcgccccagg tcagccagta tatacagcag 420 

gtacgcgtcc agcaacttca gacgctgcgg agtggagctc aagtactctt ctaagaaccg 480 

cgaaatgaca gacactaccg acgccgtaca taactgcaac gcaaggtact ccagtccgcg 540 

ccccaaactc ttggaggacc c 561 

<210> 163 , 

<211> 630 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG:451710 . 1 : 2000FEB18 
<400> 163 

gttcgtccgc tagggtttta gcccggtacc ccgaatcccg ctcctccgcc gcctcgcgtc 60 
cacccttcct cttggcg;cgc tccgatccaa gagatggcgg tgccgctgct gacaaagaag 120 
atcgtgaaga agcgggtcaa gcagttcaag aggccccatc tcgaccgcta taagtgcctt 180 
aagccaagct ggcgcaggcc aaagggtatt gactcccgtg tcaggaggaa gttcaaggga 240 
tgcaccttga tgcccaacat tggttacggc tctgacaaat caaccaggca ctacctcccc 300 
aataagttca agaagtttgt ggtgcacaat gtctctgagc tggagttgct catgatgcac 360 
aacagaacct actgtgccga gatcgcccac aacgtttcaa ccaagaagcg caaggagatt 420 
gttgaacgtg ccgcgcagct cgacattgtg gtcaccaaca agctcgccag gctccgcagc 480 
caggaggatg agtagtctgg tttcttgtca taacggccaa ctagatcctg ccattcccgg 540 
ctgtaggaaa ctactctgct ttgtgccttg ttggtactta gaataccagt cattttctac 600 
atttttggta ccagtagcga cttctatgtg 630 

<210> 164 
<211> 570 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID NO: LG: 455771 . 1 :2000FEB18 
<400> 164 

ccgcttcttg cagccgtcgc cgcgaagcgc tccagagaac ttccgtcaac atggggaaga 60 
cacgtggtat gggagccggg cgcaagctca agacccaccg caggaaccag cggtgggctg 120 
acaaggcata caagaagagc catttgggca atgagtggaa gaaacccttc gctgggtcat 180 
cccatgccaa gggcattgtc ctggagaaga ttggtattga ggccaagcag cccaactccg 240 
ctatccgtaa gtgtgctcgt gttcagcttg ttaagaatgg caagaagatt gctgccttcg 300 
tgccaaatga cggttgtttg aactacattg aggaaaatga tgaggtcttg attgctggat 360 
ttggtcgtaa ggggcacgct gtgggagata ttcctggtgt ccggttcaag gtcgtcaagg 420 
tttccggtgt gtctctgctt gcccttttca aggagaagaa agagaagcca aggtcttaga 480 
ttgctcttgc taccaaaatc agcaagcgtg gagttgaaac gggagggcgt tagatgatta 540 



ctcaatcaaa aaaaa 



2895 



82/228 



BNSDOCID: <WO 0162927A2_L> 
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PCT/US01/06059 



agaagaatgg ttgcattgct atgtttgcag 



570 



<210> 165 
<211> 821 
<212> DNA 

<213> Homo' sapiens 



<220> 

<221> misc_f eature 
<223> Incyte ID No: 



LG : 452089 • 1 : 2000FEB18 



<400> 165 

ttacacgtcg 

agggtacggc 

ggagatcaag 

cacccgcgcc 

gaggtacacg 

ggtgaggggc 

gctgctcagg 

cgaccttggt 

gctggagcgt 

tcagcacagg 

catcctcaaa 

tctggtctct 

ttgggtttaa 

tgttgtcgca 



tgactgggaa 
cgtctcctcg 
gtgcagaagc 
gcaaaggtgc 
gtgcggtcgt 
gagaaggcca 
aggaacttca 
atcaagtacg 
gcgggctacc 
gtgaccaagg 
caaggctcag 
tctcctcctc 
tttgtgttct 
agatcaccac 



aacaccgtcg 
cccccatggc 
tcgtcctcaa 
tcgagcagct 
tcggcatccg 
tgcagctgct 
gcgacaccgg 
atccttcaac 
gtgtggcacg 
aggactccat 
gcctacacgt 
cgtgtcaacg 
cagacctgat 
catgtcaaat 



tccgccgccg 
ccacgagaag 
tatctccgtc 
cagcggccag 
gcgtaacgag 
tgagagcggc 
gtgctttggt 
aggcatctac 
ccgcaggagg 
gaagtggttc 
cccccccctc 
caagaccacc 
gatttagtct 
tagttccgtg 



ccgaaggacg 
aagctgtcca 
ggggagagcg 
acccccgtct 
aagatcgcct 
ctcaaggtca 
ttcggcatcc 
ggaatggact 
tgcaagtccc 
caggtcaagt 
acctgtgggc 
acccacctcg 
gctagcactc 



gaaggagaag 6 0 
acccgatgcg 120 
gcgaccgtct ,180 
tctccaaggc 240 
gctacgtcac 3 00 
aggagtacga 3 60 
aggagcacat 420 
tctacgtcgt 480 
gcgtcgggat 540 
acgaaggcgt 600 
aaaacagctc 660 
ccaggctttt 720 
tgtgggatgc 780 
821 



<210> 166 
<211> 503 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: LG: 24 6415 . 1 :2000FEB18 



<400> 166 

tgcttcctcc 

aaaagggccg 

aggacaaggc 

acatttctga 

attactgtct 

acaaggactg 

caaagcccac 

aaataaaatg 

tgcaccatgt 



agtccgtgcc 
cggccacgtg 
tattaacaaa 
agcgagcgtc 
gagttgtgca 
aacaccccta 
gtaaggagtt 
gaaattgtac 
ggtcccagct 



tccaatatga 
cagcctattt 
ttcatcattg 
tttgatgcct 
attcacagca 
ctccaattta 
gagtccttaa 
cttaattaaa 
aca 



tgaaaaaaag 
gcgacacgaa 
gaaacacagt 
atgtgcttcc 
gagtagtcag 
gacctgcgga 
agtctgaaga 
aaatacaaaa 



aaggaacaac 
ctgtgcccaa 
ggaggctgca 
caagctgtat 
gaatcgatct 
tgctgcccca 
cggactattc 
agtagccatg 



ggtcgtacca 60 
tgtgtgccca 120 
gcagtcaggg 180 
ttgaagctac 240 
tgtgaagccc 300 
caacccccac 3 60 
tcattgagaa 420 
agtggtgaca 480 
503 



<210> 167 
<211> 753 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> mis cofeature 
<223> Incyte ID No: 



LG: 414144 . 10:20 00FEB18 



<400> 167 

ctgcaccact 

tctcctgaaa 

tttggcaaag 

cctacatgat 

acctcatgcc 

ggattgacgg 

gtcaagaata 

tttggcgtca 

caggagatct 

aatagagtgg 



agaaagatgg 
aagaggaagg 
aaggagcaga 
tcctggcggc 
ttggaattgc 
cgtgagttta 
agaccatccc 
tttgcttgga 
catggttctt 
gcttcctcaa 



cggagcaaga 
cttatcaagc 
agaaaggaaa 
agaaacgtga 
cagataaaca 
ctggtgcaga 
tctgacagac 
agacctcatt 
gtgccctttc 
ggagatgggc 



gcaaagaaaa 
cctcaaagcc 
agggctcagg 
caaggtgcgt 
ttccttgggc 
gaactcattt 
aatacagtga 
catgaaattg 
cacctctcag 
acacctggct 



atccctttgg 
acccaggcaa 
tttaagcgac 
ctcagacgac 
ctttgttgta 
tgaaacgtgg 
ttgaggagca 
ccttcccagg 
tggcccgtca 
atcggggtga 



ttccagaaaa 60 
agcaggcact 120 
tggaatcatt 180 
tagaagtgaa 240 
cgcatcgaaa 300 
acaagccaag 360 
cctggggaag 420 
gaagcatttc 480 
tgctaccaaa 540 
acgcabcaat 600 



83/228 



BNSDOCID: <WO 0 1 62927A2_I_> 
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:agccgaa ctagacccag gtgccaaact gcggtaa 



cagctcatcc gtcagctgaa ctagacccag gtgccaaact gcggtaaatt ttttatcagt 660 
gaagtggaag catgtgtttt gttttggaaa tttttatcaa gtatcttcag agaagattat 720 
ttcctgcttt atcttcagaa actggaaagg gtc 753 

<210> 168 

<211> 885 . . 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> IncytG ID No: LG: 1101445 . 1 :2000FEB18 
<220> 

<221> unsure 

<222> 754, 809, 855, 866 
<223> a, t, c, g, or other 

<400> 168 

ctggaaccat ggaggctgta ccagagaaga aaaagaaggt tgccgctgcg ccaggaaccc 60 
ttaagaagaa aaaggttcct gcggtgccag aaacccttaa gaaaaagcga aggaatttcg 120 
cagagttgaa ggtcaagcgc ctgaggaaga agtttgccct gaagacactg cgaaaggcaa 180 
ggaggaagct catctatgag aaggcaaagc actatcacaa ggagtacaga cagatgtacc 240 
ggactgagat tcgcatggct aggatggcga ggaaagctgg caacttctat gtgcccgcag 300 
aaccaaaatt ggcctttgtc atcagaatcc gaggtatcaa tggagtgagc ccaaaggtgc 360 
gcaaggtgct gcagctgctc cgtctccggc agatcttcaa tggcaccttt gtgaagctca 420 
acaaggcttc agtgaacatg ctgaggatcg tggagcccta cattgcatgg ggggtacccc 480 
aacctgaagt cagtaaacga gctcatctac aaacgaggct atggcaaaat caataaaaag 540 
cgcattgcct tgacagataa ctccttggtt gctcgatctc ttggtaaatt tggcatcatc 600 
tgcatggagg atctaattca tgagatctat acagttggaa aacgctttaa ggaagcaaat 660 
aacttcctgt ggcccttcaa actgtcttcc ccacgaggtg gaatgaagaa • aaagacaact 720 
cactttgtag aaggtggaga tgctggcaac aggngaggac cagatcaaac aggcttatta 780 
ggaaggaatg aacttaaggt ggtctaccna tggattattt ttctaagctg ggttgggtta 840 
ataaacagta cctgnctctc aaaatngaaa aaaaaaagaa taaaa 885 

<210> 169 
<211> 586 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 452134 . 1 : 2000FEB18 
<400> 169 

tacacccgcc gagatattga tgcctaagaa gaactgggat tcccatttat gaactccttt 60 
ttaaggagag agtcatggtg gccaagaagg atgtccacat gcctaagcac ctggagctgg 120 
cagacaagaa tgtgcccaac catcatgtca tgaaggccat gcagcctctc aagtcccgag 180 
gctaagtgaa ggtacagttt gcctggagac atctctactg ctaccttacc aatgagggca 240 
tccagtatct ccgtgattac cttcatatgc cccggagatt gtgcctgcca ccctacgccg 300 
cagccattct gagactggca ggcctcagcc taaaggtctg aagggtgagc gacctgcaag 3 60 
actcacaaga ggggaagccg acagagatac ctacaggcag attgctgtgc cccctgatgc 420 
cgacaggaag gctgaggctg aggctggggc tgggtcagag acggaattcc agtttagagg 480 
cagatttggt tgtggaggtg gtcagccacc tcagtaaagt tggagaggat tattttgcat 540 
taaataaact tacagccaaa aaaaatttaa aaaaaaatct cattaa 586 

<210> 170 
<211> 659 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 903021 . 1 :2000FEB01 
<400> 170 

ctcagttgtg agggggaagg tagactcttt ccagaccttt gataaagggg ctgctcaaaa 60 

84/228 
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aagccacgac 
taaaaaacct 
acccaagtgg 
gagaagaaga 
gagctgaaga 
aggaagctta 
actgaaattc 
cccaaattgg 
aaaggtgttg 
cagggcttca 



cttgggcctc 
tatccctctt 
aagtcagcag 
aggttcctgc 
tcaagcgcct 
tctatgaa^a 
aaatatcga,g 
cgtttgtcat 
caacttcttt 
ataatgtgct 



) 

tgctaaaaccr 
tctaatctct 
gaaagtagct 
tgtgccagaa 
gagaaataag 
agcaaagcac 
gatagcaaga 
caggatcgga 
gccttcatca 
gaggtttgta 



atggckt.taa, 
tatgtcctgt 
gggatgatgc 
acccttaaga 
tttgcccaaa 
tatcacaagg 
aaagctggca 
ggttatcaat 
aatcttcaat 
gaaccatata 



gtaaggaggg 
gagtgtcacc 
agtctttaca 
aaaagtgaag 
agatgcttct 
aatatatgca 
acttctatgt 
tgggtgagcc 
agaaactttg 
ttgcatgggc 



PCT/US01/06059 

agcagatata 120 
cattgagtaa 180 
gggtgttgaa 240 
gaatttcaca 300 
aaaggcaagg 3 60 
gatgtacaga 420 
atctgcagaa 480 
caaaggtctg 540 
tgaagctcag 600 
atacccaaa 659 



<210> 171 
<211> 443 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 246422 . 1 :2000FEB01 



<400> 171 

tgttgccttc 

gagggctaaa 

gaagatggaa 

aacggaaaag 

gaccagcgtg 

atatctgaag 

ttacgaatta 

ttcatttatc 



tttctgcctc 
aaaaaaggaa 
tcatggatgc 
ctgggaactt 
acatccaagc 
aagaataatc 
cgttacttcc 
ggccgggcgc 



cgctaccgcc 
gcaggttcca 
tgccaacttt 
tggtggaggg 
tgcccttttc 
tacatgattg 
aaattaacca 

ggt 



atggcgccca 
aagttcactc 
gagcagtttt 
gtagtgaccc 
caacaggtat 
gttgcgcgta 
ggacgaagag 



tgaaaaagct 
ttgatcgcac 
tccaagaaag 
atcgaaggga 
ttgaaatatc 
gttgctaaca 
gaggaaaacg 



tgtggtaaag 
ccaccccgta 
gatcaaaatg 
gcaagagcaa 
tcaccaaaaa 
gcaaacagag 
aggattaaat 



<210> 172 
<211> 586 
<212> DNA 

<213> Homo sapiens 



60 

120 

180 

240 

300 

360 

420 

443 



<220> 

<221> misc_f eature 
<223> Incyte ID No: 



LG : 449404 . 1 : 2000MAY19 



<400> 172 

atccgtcgcc 

tgcctccgct 

cccaaggagg 

gttgctcaca 

agggaaactc 

tcaccttacg 

attactgcac 

cctggtgccc 

gaggacgtta 

gaggactgta 



ccacggaagt 
ccggtcaccc 
agaacgtcac 
tctttgcatc 
tggttcggat 
ctgctatgct 
tgcacattaa 
agtctgccct 
ccccggtccc 
ggcgtcatca 



ctcggcggcg 
gtcgctccac 
ccttggaccc 
cttcaatgac 
caccggtggc 
tgctgctcaa 
gcttcgtgcc 
cagggcgctt 
cacggacagc 
ttactgcgtg 



ccggcgtgct 
gcaaccatgt 
actgtccgtg 
accttcattc 
atgaaggtga 
gatgtcgcac 
accggaggca 
gctcgttccg 
actcgcagaa 
ccatttgctg 



tgtgatctca 
cgaggaggaa 
aaggagagta 
atatcactga 
aggctgaccg 
agcgctgcaa 
acaagaccaa 
ggatgaaaat 
agggcggtag 
gctctt 



acccaagtcg 60 
gac c agggag 120 
tgtctttggt 180 
tttgtctggg 240 
tgacgagt eg 3 00 
ggagcttggc 3 60 
gacccctgga 420 
eggaegcatt 480 
gaggggaaag 540 
586 



<210> 173 
<211> 551 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 



LG:449413 . 1 :2000MAY19 



<400> 173 

ccgccggcct 

agccggttcc 

tcaagaccca 

gcaatgagtg 

agattggtat 

tggtgaagaa 



ccaactctaa 
ttcctccgcc 
ccgcaggaac 
gaaaaaacct 
tgaggccaag 
tggaaagaag 



ccccgttgct 
accatgggga 
cagaggtggg 
tttgctggat 
cagecaaatt 
attgetgect 



tccggctccc 
agacacgtgg 
ctgacaaagc 
catctcacgc 
cggccatccg 
ttgtgccgaa 



gcagcgtcgc 
tatgggagct 
atacaagaag 
caagggcatc 
taagtgtgcc 
tgatggttgc 



cgaccttcac 60 
gggegcaage 120 
agecacttgg 180 
gttctggaga 240 
cgtgttcagc 300 
ctaaactaca 360 



85/228 
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tcgaggagaa tgta^atga ggtgttgatt gctggatttg gtcgraaggg tcatgctgtg 420 
ggagacattc ctggtgtcag gttcaaggtt gttaaggtgt ctggtgtgtc gctgcttgca 480 
ctcttcaagg agaagaagga gaagccaagg tcttagatca ctttcggtag tccagaatgg 540 
tgtaaactgc c 551 

<210> 174 

<211> 565 

<212> DNA ' 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG:450105 . 1 :2000MAY19 , . ' , 
<400> 174 • ' 

gccgtcggca gctcggccgc cgtactcccg ctaccgagct aggaggcatc accttcgccg 60 
atccaacatg ggtaagacac gtggtatggg agctgggcgc aagctcaaga cccacagaag ,120 
gaaccagagg tgggctgaca aagcctacaa gaagagccat ctcggcaacg agtggaagaa 180 
accctttgct gggtcatctc acgcaaaggg aatcgtcctt gagaagatcg gcattgaggc 240 
taagcagcct aactctgcta tccgtaagtg cgctcgtgtc cagctggtga agaacgggaa 300 
gaagattgct gcctttgtgc caaacgatgg ttgcttgaac tacatcgagg agaacgatga 360 
ggtgctgatt gctggattcg gtcgtaaggg ccatgctgtg ggagatattc ccggcgtccg 420 
tttcaaggtc gtgaaggtct ctggcgtttc cctcctcgct ctcttcaagg agaagaaaga 480 
gaagccgaga tcttaaaacg ctgcaagggt ttgggcctgg tggcgcaccc cacacctgta 540 
gcgctcacac aagccatagc agacc 565 

<210> 175 i 

<211> 336 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 460809 . 1 : 200 0MAY19 
<400> 175 

gcgcatgggt ggagtgggct tcaagaagtg cccctcgggc acacagagag atccagaaat 60 
tcgccatgaa ggagatgggg actccaaatt tgcacattga tgtgaggctc aacaaagctc 120 
tctgggccaa aggaataagg aatgtcccat accatatcca tatgaagttg cccagaaaac 180 
ttaatgagga tgaagattca ccagacaagc tctatgcttt ggttcctaca tatacctgtt 240 
accactttca caaatctata gacaggcaat gtggaagaga gctaaccact gatggttcaa 300 
tacattaagt aaaattattt ttaaaaaaga aattta 336 

<210> 176 
<211> 932 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 481781 . 1 : 2000MAY19 
<400> 176 

ggcggagcgt aaggcgaagg agtagcagca gcaggcggcg ccgagtagcg gctccccatc 60 
tcgagcttgc caccatggct agaggattga agaagcattt gaagaggctc aatgccccca 120 
agcattggat gctggacaag cttggcggag cttttgctcc caagccatct tctggacctc 180 
acaagtctag ggagtgcctg ccactgatcc tcatcatcag gaacaggctc aagtatgctc 240 
ttacataccg tgaggtcatt tccatcctga tgcaacgcca tgtacttgtt gatggcaagg 300 
tcaggacaga caagacctac cctgctgggt tcatggatgt catttccatc cccaagacca 360 
atgagaacta caggctgctg tacgatacca agggccgctt ccgccttcac ccaatcaggg 420 
atgaggatgc taagttcaag ctttgcaagg ttaggtctgt ccagtttggg cagaagggca 480 
tcccttatct gaacacgtat gacggccgca ccatccgcta ccctgacccc ctcatcaagg 540 
ccaacgacac catcaagatc gatctggaga ccaacaagat tgtggacttc atcaagtttg 600 
atgttggcaa cgtcgtcatg gtgactggcg gaaggaacac tggccgcgtg ggtgttatca 660 
agaacaggga gaagcacaag ggcagctttg agaccattca cgtggaggac tcctgggcca 720 
ccggttcgcc acccgtatgg gcaacgtgtt caccatcggc aagggtaata agccgtgggt 780 
gagcctgccc aagggcaagg gaatcaagct gagcatcata gaagagcaaa ggaagcggga 840 



86/228 
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tgccgctgcc caggctgctg ccaacgcata aatcctctga atgtgctgtg ttgagagttt 900 
tttctagttg ctctgcaaga atatagaaca at 932 

<210> 177 ' 

<211> 733 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc — feature 

<223> Incyte ID No: LG: 1101153 . 1 : 2000MAY19 



<400> 177 

gcgaaggcga 

gccgggcaac 

gaacacaagg 

ccttgaggat 

tggtcgcacc 

gtcagccagg 

cttggacgtg 

gcgccgcacc 

tgagctgatt 

ggcaaggaag 

gccgtagctc 

atgttttgta 

aatttgatat 



ccattccttc 
cctaccaaat 
gagacagctt 
gttattgctc 
gcacaagcaa 
ttcatattgg 
gacaacctct 
taccgtgctc 
ctgtcagaga 
cagtgaagta 
ttttatgtac 
ctgtgctcga 
cct 



ttcgctcctt 
cggccaaggc 
ttgcgcttcg 
acaagcaggc 
agtctcgcca 
atttgctgaa 
atgtttcaca 
acggacgcat 
aggaagagcc 
tgttagcctg 
cttttcaggc 
gtgtaaggtc 



cgccgccgct 
catgggaagg 
caagctgcct 
aattcccttc 
ctccaatggg 
gaatgctgag 
catccaggtg 
taacccttac 
tgtgaagaaa 
ctgataatgg 
tggtaccgat 
agttcgtaat 



atggttaagt 
gacctgaggg 
ttgaccaagg 
cggagatact 
cagggtcgct 
agtaacgctg 
aaccaagccc 
atgtcctccc 
gaggctgaca 
catcctttcg 
accgtggatc 
acaatcaatc 



actcgcagga 60 
tccacttcaa 120 
ctaagcgata 180 
gtggaggtgt 240 
ggcctgttaa 300 
atgtgaaagg 3 60 
agaagcagag 420 
cttgccacat 480 
acattgtcgc 540 
aagcattata 600 
tggatttgct 660 
tgcgtggaac 720 
733 



<210> 178 
<211> 699 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc__f eature 
<223> Incyte ID No: 



LI : 25769 5 . 20 : 2000MAY01 



<400> 178 

ggcgccgcgc 

ctgcagtctg 

acgcggcaat 

gctacagtac 

agacatccgt 

ggaactgggc 

ctttaattgt 

tattaccaga 

tcaagtcaag 

ggatgaggct 

ggtagctgac 

ctaccagaaa 



ggtgaaggtt 
tgcaggtctt 
ggtctcatca 
aagctgctgg 
gtccgtgtaa 
gcatggagga 
gtgtctttct 
aatgtgagtg 
gactctgtgt 
tccaagaagg 
cctcgtcgct 
tcctaccgat 



gtctagtcca 
cggacgcaag 
aggtgaacgg 
agccagttct 
agggtggtgg 
ggtggctctg 
cacagctatc 
agcatgggtc 
gattatctaa 
agatcaaaga 
gcgagtccaa 
aagcccatcg 



cgcttcggag 
aagacagcga 
gcggcccctg 
gcttctcggc 
tcacgtgccc 
ggagggaggc 
cgtcagtcca 
cttcccatga 
atcctcgtcc 
catcctcatc 
aaagtttgga 
tgactcaaa 



ccatgccgtc 
cagctgtggg 
gagatgattg 
aaggagcgat 
cagatttatg 
cttcacagcg 
tctccaaagc 
ggtaggtggg 
ctgctcttct 
cagtatgacc 
ggccctggtg 



caagggcccg 6 0 
cgcactgcaa 120 
agccgcgcac 180 
ttgctggtgt 240 
gtgagtccca 3 00 
ctcctgtacc 3 60 
cctggtggcc 420 
tgtgtgggga 480 
tgccagatgt 540 
ggaccctgct 600 
cccgcgctcg 660 
699 



<210> 179 
<211> 568 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 



LI : 455771 . 1 : 2000MAY01 



<400> 179 

ccgcttcttg 

cacgtggtat 

acaaggcata 

cccatgccaa 

ctatccgtaa 

tgccaaatga 

ttggtcgtaa 



cagccgtcgc 
gggagccggg 
caagaagagc 
gggcattgtc 
gtgtgctcgt 
cggttgtttg 
ggggcacgct 



cgcgaagcgc 
cgcaagctca 
catttgggca 
c tggagaaga 
gttcagcttg 
aactacattg 

gtgggagata 



tccagagaac 
agacccaccg 
atgagtggaa 
ttggtattga 
ttaagaatgg 
aggaaaatga 
ttcctggtgt 



ttccgtcaac 
caggaaccag 
gaaacccttc 
ggccaagcag 
caagaagatt 
tgaggtcttg 
ccggttcaag 



atggggaaga 60 
cggtgggctg 120 
gctgggtcat 180 
cccaactccg 240 
gctgccttcg 3 00 
attgctggat 3 60 
gtcgtcaagg 420 



87/228 



BNSDOCID: <WO 01 62927A2_I_> 



WO 01/62927 



ttcYgc 



PCT7US01/06059 



tttccggtgt gtctcYgctt gcccttttca aggagaagaa agagaagcca aggtcttaga 480 
ttgctcttgc taccaaaatc agcaagcgtg gagttgaaac gggagggcgt tagatgatta 540 
agaagaatgg ttgcttgcta tgtttgca 568 

<210> 180 

<211> 444 

<212> DNA . 

<213> Homo sapiens 

<220> 

<221> ihisc_ feature 

<223> Incyte ID No: LI ; 274551 . 1 :2000MAY01 



<400> 180 

gtcacatcct 

accactttac 

ttcttaaggt 

agcttgacat 

ttgtctaagg 

gtcttcagtc 

ttggcatggt 

catgttgcca 



tggcctgaaa 
aaagttttcc 
catcaacctt 
acataggctc 
ctttggcagc 
tcttgtagag 
ggtgggtaca 
tgcagagaaa 



ttctttgctg 
ttgtctatca 
aattgggttg 
ttcacatttg 
tttgcaggtt 
cagtattaat 
ggtgaaggtg 
gaga 



tagcccttca 
gtfcttacaaa 
atttggtgtt 
gatgcaagca 
ccacctgcta 
atccattaca 
aatcttgagc 



ctgtgacaca 
ggcttagcca 
cggcacaaag 
cgcaaagatg 
ggccatcatg 
cctccagcag 
cactcaactt 



actgcaacca 60 
tgtctctggt 120 
ggccacccct 180 
ggcttggccc 240 
gctgagggcg 300 
cgtgtcttcc 360 
ctgctctgag 420 
444 



<210> 181 
<211> 779 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 



LI : 035973 . 1 :2000MAY01 



<400> 181 

tatagggtaa 

ttccggataa 

agaatagctg 

ctgtgtccct 

agctaaggga 

gagggtgtct 

agtgtcgtga 

agaagtcaac 

ctgccacggc 

tactgcagat 

gttgtgtcat 

tttttaaaaa 

ttacatatat 



acatccaatt 
gggaagctgg 
ctgactgatg 
tgcttcagtc 
agagatttgg 
ggcaggtatc 
tatgatacac 
ctgtggcaaa 
taaaagaaaa 
ttaggcatgg 
ccagttcatc 
tgtatatatt 
gtgaaaagag 



aaaaggcaga 
tgggcaggaa 
cctgggggac 
tgatctttgg 
gaatgtttgg 
cgtaaagata 
aagttgtgct 
tgtggctccc 
taccacggcg 
attctttgca 
ttaagaattt 
ttaaacatat 
gcagagattg 



tattggc tat 
aaaggcatgc 
caagcccata 
ggagcctgca 
ccctgtgtaa 
ataagatgaa 
gcctctgtgg 
ctgccaagcg 
actggttgaa 
ggaacaacac 
caatgattag 
atatgtttat 
tcagattgga 



ccttggggag 
aatggccctc 
attgtggagc 
cagttctgac 
cagcaaaagg 
gggaacatcg 
ctctaaggca 
caagagaaag 
tgaagcacct 
ctacacccaa 
tcacacaata 
atgtatatgt 
ttaaaaagct 



gccagcaaat 60 
atgagccaga 120 
tggagccgca 180 
ctcagttaag 240 
attcattggt 300 
ccgtttggaa 3 60 
taccaccttc 420 
tgtaactgga 480 
aaacattgta 540 
gagggcagca 600 
aatattccgg 660 
tatatctgta 720 
gtctgtaag 779 



<210> 182 
<211> 524 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 978427 . 5 :2000FEB18 



<220> 

<221> unsure 
<222> 453 
<223> a, t, c, 



g, or other 



<400> 182 

gtggtgggta 

gtggattaac 

cctcactgtc 

agctctagga 

ctttgctgcc 



tgtgacggat 
cagtccttca 
aatgagttgg 
ctcaatgcat 
gtagctgctg 



gcctctgttt 
atgccgtcgt 
gaacagctta 
tgaccaagca 
ctaattgcat 



taggactacg 
caattacacc 
cgtttctgca 
tgtctcacca 
taatattcca 



ccggctgtgc 
aacagaagtg 
acaactggtg 
ctgataggac 
ttaatgaggc 



tgttctggca 60 
gagacgcacc 120 
ccgtagcaac 180 
gttttgttcc 240 
aaagggaact 300 



88/228 



BNSDOCID: <WO 01S2927A2J_> 



WO 01/62927 



PCT/US01/06059 



caaagttggc attcccgtca cggatgagaa tgggaaccgc ttgggggagt c^gcgaacgc 3 60 

tgcgaaacaa gccatcacgc aagttgtcgt gtccaggatt ctcatggcag cccctggcat 420 

gggcatccct ccattcatta tgaacacttt ggnaaagaaa gcctttttga agaggttccc 480 

catggatgag tgcacccatt caagttgggt tagttggatt ctgt 524 

<210> 183 ' 

<211> 2340 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 
<223> Incyte ID No: 



LG : 247781 . 2 : 2000FEB18 



<400> 183 

ggcaggggcc 

acctatgcgg 

tacagccacg 

agcacctgcg 

caagcctcca 

tcccaggagg 

ccagctgtga 

tccaggtgag 

ggagactgat 

atcctggatc 

cctggatcct 

ctagatccca 

tgctacccta 

aaatcactgg 

tactggatcc 

cccaacaccc 

cctcaggcac 

tccccatgcc 

cctcaccact 

ccagatcccc 

ttaacctcga 

actacgcctt 

ctggctccta 

agctgctgga 

cctcatctct 

ccggtttctg 

atcctggcag 

tgacatgaaa 

acctgcacgg 

ctgtggggaa 

cttccatccc 

tccgaggaat 

gaagggatgt 

aggagaaaat 

aaaaacaaaa 

agaggagcaa 

agcttaggat 

aggggctgga 

ctgcctgtgt 



ccgtgtctta 
gcatcgacct 
actcggcaga 
gccagatagc 
tcgagggtgg 
gcatgcgggg 
gcatctccta 
ggaccqggag 
gatccaacca 
ctagactcct 
agactcctat 
gatccccctg 
accacaagat 
atcctgcagc 
tggacctcta 
caaccacagg 
cagatccagt 
tcgagccctg 
ggatcctgac 
tcactccacc 
cact^gggtct 
ccacccccag 
aaacaagtgc 
ttccagattc 
tgaaggcaga 
atctgaatct 
cccagtcaca 
ttgcatggtc 
atgctggatt 
actgttccct 
cctcccacct 
cactcactcc 
ccccacccca 
cggcaaagac 
aaaaaccacc 
agtcctcctc 
catctggaga 
ggacttccgc 
gtgttatttc 



ctagcgatgc 
ggccgtctac 
cccaggcatc 
cagttacccg 
cccccagctg 
cctctaccgg 
tgtggtctac 
cccgtccccc 
caggatccct 
atgccccaac 
gccccaacca 
taaccataac 
tcaagattat 
cccacatcct 
cgtcttaacc 
agcacggatt 
gtcctagtga 
gatctccaag 
aaccacaatg 
cactggattc 
cagatccttc 
gcactggatc 
gggggcccca 

tgggtcccca 

tcctgatatt 
tccaggcgcc 
gagtatccca 
gagggagtct 
ccccctcacc 
ggaaccactc 
tctaggcgaa 
tggaggctgg 
gcactatcag 
cacccttccc 
aaatccccaa 
ccctgcgccc 
ccccatggag 
acgcttccac 
aaaggaaaag 



cgtatctccc 
gagactctga 
ctcgtgctcc 
ctggccptgg 
tccatgctgg 
gggatcgccc 
gagaacatga 
caatccctca 
actctttggc 
cattgggtca 
ctgggtcatg 
tgtggatccc 
ccacacccca 
aaggtggatc 
actggatccc 
ccctgtacct 
aacgctggat 
ctcagctgct 
cctggatcct 
ctgcattggt 
tgctgactgc 
ccagattccc 
gtggcacagc 
atccctctgc 
ccaaggcact 
gggtcccaaa 
cacacactgg 
gtggggaagg 
ccacctctgc 
cactccctgc 
tagtccccag 
caaggagaca 
ggcctcccca 
taaacccaag 
attcattcca 
ttacattctg 
ggacttggaa 
ctccttcgac 
aacaaaagga 



caacagtgct 
agaactggtg 
tggcctgcgg 
tccggacccg 
gtctgctacg 
ccaacttcat 
agcaggcctt 
ccccccacac 
cacgagatcc 
t&ggatccca 
cgatccccac 
ttacttcagc 
gcccttaatc 
ccacgcttcc 
acacaaatca 
caacacccag 
cctagatccc 
ggattctgga 
ggggccccca 
ttttggtttc 
cagatccctg 
aagccttcac 
aagtggatcc 
ccagtccctc 
gaatcccaag 
tgttcaggcc 
tgcccagagc 
aagcccaggt 
atggccaccc 
atccccacac 
agctgtgttc 
gtctgaggcc 
ggcttccaga 
cacccaatga 
gatctatttt 
cacttcatag 
aggggaactg 
ctccactgcg 
ataaattttc 



gggtatcatc ' 

gcttcagcag 

taccatatqc 

catgcaggca 

tcacatcctg 

gaaggttatt 

gggggtcacg 

ctcagccact 
cagtacccag 
gcatccagat 
ccttcagcca 
aactcaagtc 
cccatccccc 
ctgtgccccc 
gtgaatggat 
accctgcctc 
caaccccaga 
tgtcaacaaa 
tcactggatc 
ctgctccttt 
catttcaagc 
ccaccagatt 
tggcaactgc 
aatgttgaaa 
ccctgaatcc 
ccaagtctag 
cggcttctca 
cctggctgca 
cctcccagcc 
ttcacagcat 
ctccaagggg 
agggacacat 
gttgaaagcc 
tgcaaaaaac 
tctaccagag 
ttggattctg 
ggatttgggg 
ccccacctcc 
taagctcttt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 



<210> 184 

<211> 1199 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 



LI:034583 . 1:2000FEB01 



<400> 184 

gtgcagcatg ccaccgtgat tcccgagacc atggcaggca cccagcagct ggcggactgg 60 
agaaatacgc atgctcacgg aagccattat aatcccatca tgcagcagcc tgcactattg 120 



89/228 



BNSDOCID: <WO 01 62927A2_L> 



WO 01/62927 i A ^te, PCT/US01/06059 



• # 

icwttcc agcagcacag cccttaaatg tgggVgt 



accggtcatg tgacWttcc agcagcacag cccttaaatg tgggVgtggc ccacgtgatg 180 

cggcagcagc caaccagcac cacctcctcc cggaagagta agcagcacca gtcatctgtg 240 

agaaatgtct ccacctgtga ggtgtcctcc tctcaggcca tcagctcccc acagcgatcc 300 

aagcgtgtca aggagaacac acctccccgc tgtgccatgg tgcacagtag cccggcctgc 360 

agcacctcgg tcacctgtgg gtggggcgac gtggcctcca gcaccacccg ggaacggcag 420 

cggcagacaa ttgtcattcc cgacactccc agccccacgg tcagcgtcat caccatcagc 480 

agtgacacgg acgaggagga ggaacagaaa cacgccccca ccagcactgt ctccaagcaa 540 

agaaaaaacg tcatcagctg tgtcacagtc cacgactccc cctactccga ctcctccagc 600 

aacaccagcc cctactccgt gcagcagcgt gctgggcaca acaatgccaa tgcctttgac 660 

accaagggga gcctggagaa tcactgcacg gggaaccccc gaaccatcat cgtgccaccc 720 

ctgaaaaccc aggccagcga agtattggtg gagtgtgata gcctggtgcc agtcaacacc 780 

agtcaccact cgtcctccta caagtccaag tcctccagca acgtgacctc caccagcggt 840 

cactcttcag ggagctcatc tggagccatc acctaccggc agcagcggcc gggcccccac 900 

ttccagcagc agcagccact caatctcagc caggctcagc agcacatcac cacggaccgc 960 

actgggagcc accgaaggca gcaggcctac atcactccca ccatggccca ggctccgtac 1020 

tccttcccgc acaacagccc cagccacggc actgtgcacc cgcatctggc tgcagccgct 1080 

gccgctgccc acctccccac ccagccccac ctctacacct acactgcgcc ggcggccctg 1140 

ggctccaccg gcaccgtggc ccacctggtg gcctcgcaag gctctgcgcg ccacaccgt 1199 

<210> 185 

<211> 1012 

<212> DNA 

<213> Homo sapiens 

<220> . 
<221> mis cofeature 

<223> Incyte ID No: LI : 333307 . 2 : 2000FEB01 
<220> 

<221> unsure 
-<222> 965 

<223> a, t, c, g, or other 
<400> 185 

aacagatgtg aaaaaagaat tatgacaagg acaacctaat ataagtttta taaattttag 60 
gaaatgtcat ctttttactg gtttcagaga ctttttcaaa tgttttaaac acaattttca 120 
gataacaata atgaaaagct gagccccaaa ccagggacag gtgaaccagt tttaagtttg 180 
cactacagca cagaaggaac aactacaagc acaataaaac tgaactttac agatgaatgg 240 
agcagtatag catcaagttc tagaggaatt gggagccatt gcaaatctga gggtcaggag 300 
gaatctttcg tcccacagag ctcagtgcaa ccaccagaag gagacagtga aacaaaagct 3 60 
cctgaagaat catcagagga tgtgacaaaa tatcaggaag gagtatctgc agaaaaccca 420 
gttgagaacc atatcaatat aacacaatca gataagttca cagccaagcc attggattcc 480 
aactcaggag aaagaaatga cctcaatctt gatcgctctt gtggggttcc agaagaatct 540 
gcttcatctg aaaaagccaa ggaaccagaa acttcagatc agactagcac tgagagtgct 600 
accaatgaaa ataacaccaa tcctgagcct cagttccaaa cagaagccac tgggccttca 660 
gctcatgaag aaacatccac cagggactct gctcttcagg acacagatga cagtgatgat 720 
gacccagtcc tgatcccagg tgcaaggtat cgagcaggac ctggtgatag atttaatatc 780 
agaggaacaa caataggtga tagaataatg agacgctctg ctgttgcccg tattcaggag 840 
ttcttcagac ggagaaaaga aaggaaagaa atggaagaat tggatacttt gaacattaga 900 
aggccgctag taaaaatggt ttataaaggc catcgcaact ccaggacaat gataaaagaa 960 
gccantttct ggggtgctaa ctttgtaatg agtggttctg actgtggcca ca 1012 

<210> 186 
<211> 318 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 814710 . 2 :2000FEB01 
<400> 186 

cgaaggacct gaaggaaagt ggaaaagtcc aaagtttaag atgccagaga tgcattttaa 60 
gactccaaag atatccatgc cagatattga cctgaatctc acaggtccaa aaataaaagg 120 
agatgtggat gttacaggcc ctaaggtaga gggagatctg aaaggtcctg aagttgacct 180 
caaaggcccc aaagtggaca ttgatgtccc agatgttaat gttcagggtc cagactggca 240 
cctgaagatg cccaagatga aaatgcctaa gttcagtatg cctggcttca aaggagaggg 300 



90/228 



BNSDOCID: <WO 0162927A2_L> 



I 



WO 01/62927 PCT/US01/06059 
cccaggttct agatcgcg 318 

<210> 187 
<211> 677 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: LG: 414732 . 1 : 2000MAY19 
<220> 

<221> unsure 
<222> 45 , 618 

<223> a, t, c, g, or other 
<400> 187 

agaggagcga aaagcatggt ccggggggcg gcggacgcct ggtangcgct cgggccgttg 60 
ttgtggggct gcgcgctggc gctgcagggc gggatgctgt acccaagaga gaggccgtcg 120 
cgggagcgga agaagctgga ccgcctctgg agcttccgcg ccgacttctt cgacaaactg 180 
tgccttggct tctaggagta gtggtatcgg cgtctgctgc gagagtcggg ctccaccatg 240 
gacataccgg ttccctccag cttcaacgac gttggccagg actggcggct gcggcatttt 3 00 
gtagaccaga tgtggtacga acgggaggtg accttcctgg agcaatggac ccaggacctg 360 
cacacaagag tggtactgag gattgtcagt gcccactcct atgccatcgt gtgggtgaat 420 
ggggtcgacg cgctagagca tgagggatct acctcccctt tgacaccgac atcagtagcc 480 
tgttccaggt ggggcccctg ccctcccgcc tccgcatcac tatcaccatc ggcaacatgc 540 
tcatctcctc caccctgcca ccagggagca tcctcgacat ggccgacacc tccacgtggg 600 
taccatcctg cttccacngc agacacccac cttcctgtcc caccccgtgg ggcattacat 660 
tagggtaatt acattag 677 

<210> 188 
<211> 1295 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: LG : 413910 . 6 : 2000MAY19 
<220> 

<221> unsure 
<222> 1275 

<223> a, t, c, g, or other 



<400> 188 

gatttggctc ccagctttgt agttcggaag 
tccattgatg tgttgagctt cagagggaat 
caaggagttc tcctgcatcc ttatggcgtg 
cctggactga ttcagggtaa tcaggaagca 
tacgcttcgg cccagtttgc tcccccgcag 
catccccacc ccgcgccaga gtacacaggc 
ctgtaccctc ccgcccagac gcactccgag 
acacgtgctg tggcattatt tacctcgatt 
ttctccctcg ccctgtccgc atcggcacta 
gagcctcgct cattaaaaat aatttcaaca 
tgtctgaggt ccggatcccg gaacctgaag 
tgagacttgt tttccgtgtt ttcagaaggt 
gcgtcatctg tctgtgtggc ggtgccagag 
tgctcggagt gcgtctgggc gggagggtac 
tggcctgtgt actctggcgc ggggtgggga 
ttctgcgggg gagcaaactg ggccgaagcg 
gctgcttcct gattacccct tagctgctct 
tttctgtctt ctatttccac gcaggtattg 
cccacgactg caattagctg ccaagatttc 
tcaccgggaa aacacagcac gggagaagtc 
agcaatgtaa atgactccag tcatctgtcg 
ctgcctggca cgacngccag caggtgcaga 



aagttgggtc tatagatttc cccctaactc 60 
aataactcta cgt'aaagcat gctggcgtct 120 
cctatgattg taccggcagc tccttacctt 180 
gccgctgccc ctgacacaat ggctcagcct 240 
aacggtatcc ccgcggaata cacggcccct 3 00 
cagaccacgg ttcccgagca cacattaaac 360 
cagagcccgg cggtcttttt atttgtoatt 420 
ttacggccct ctaccacggt gccgtgtaat 480 
ttttcgaaag ttacgaaacc aaatcccttt 540 
tctaagattt taccaaattg accaaacatt 600 
gggatattgg agacatgcag ccgcttgggc 6-60 
tgtgtctggg gctggccatc cgtcggtgct 720 
acggtctgag cgctcgtgtc cgccgggctc 780 
aggtttaatg tgtgctcggg aaccgtggtc 840 
tgaggggccg tgtattccgc ggggataccg 900 
taaggctgag ccattgtgtc aggggcagcg 960 
ctttcacaat tcatctgttg tcttgaaacc 1-020 
cagtctgctt tgatatcctg aagggcatct 1080 
attgtcggct gcaaagtgcc tcctgtttcc 1140 
cgaggcaaca gcaaacggca cccacgtgct 1200 
ttgcgcgccg <:gcgccttcg ctcgcccggg 1260 
acgcc 1295 



91/228 



BNSDOCID: <WO 01 62927A2_I_> 



WO 01/62927 



PCT/US01/06059 



<210> 189 
<211> 751 
<212> DNA 

<213> Homo sapiens . 
<220> 

<221> misc^feature 

<223> Incyte ID No: LI : 414732 . 2 :2000MAY01 
<400> 189 

gagagaggag cgaaaagcat ggtccggggg gcggcggacg cctggtaggc gctcgggccg 60 
ttgttgtggg gctgcgcgct ggcgctgcag ggcgggatgc tgtacccaag agagaggccg 120 
tcgcgggagc ggaagaagct ggaccgcctc tggagcttcc gcgccgactt cttcgacaaa 180 
ctgtgccttg gcttctagga gtagtggtat cggcgtctgc tgcgagagtc gggctccacc 240 
atggacatac cggttccctc cagcttcaac gacgttggcc aggabtggcg gctgcggcat 300 
tttgtagacc agatgtggta cgaacgggag gtgaccttcc tggagcaatg gacccaggac 360 
ctgcacacaa gagtggtact gaggattgtc agtgcccact cctatgccat cgtgtgggtg ,420 
aatggggtcg acgcgctaga gcatgaggga tctacctccc ctttgacacc gacatcagta 480 
gcctgttcca ggtggggccc ctgccctccc gcctccgcat cactatcacc atcggcaaca 540 
tgctcatctc ctccaccctg ccaccaggga gcatcctcga catggccgac acctccacgt 600 
gggtaccatc ctgcttccac cgcagacacc caccttdctg tcccaccccg tggggcatta 660 
cattagggta attacattag gagaacttgt tgtttaaaag agtgtggcac cttcatccac 720 
ctttttcctt tcttgccatg tgacacacca g 751 

<210> 190 1 
<211> 4466 

<212> DNA i 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 9 00264 . 2 :2000MAY01 
<400> 190 

tgaaagtact ttgactccca agggtatctg atctgatgga gagaatgtat cacttgtacc 60 
gatgcggata gaacgctcat taattacagc acagttttca taaattccaa cttccctagg 120 
aattttgtct tttgaaggca tgttggaagt tggggcatta ggatataatc tcagtccatg 180 
tactaagcct ctgcataact gaaagataga gtcaataaaa ttcactctct cattttttgt 240 
ttttaatctc aagaattttt gccttgaagg acgaatgtag tggcgaattt gatgtttgaa 300 
cgaatcccag tcagaagttc cagcctgcca ctgttctctg atgccatgct atgaccaaca 360 
tcaagctgta ttgtcctctc atccgtaact gtgaactgag caggatctat ggcactgcat 420 
gttactgtca ccagaaacat ctctgtttgt tcctcatcgt acattcctca gagtcgactg 480 
agatacacac ctcatccagc atatgctacc ttttgcaggc caaaggagaa ctggtggcag 540 
tacacccaag gaaggagata tgcttccaca ccacagaaat tttacctcac acctccacaa 600 
gtcaatagca tccttaaagc taatgaatac agtttcaaag tgccagaatt tgacggcaaa 660 
tagtgtcagt tctatcctgt cgatttgaca gcaatcagct gcctgcaaat gcacccattg 720 
aggaccggag aagtgcagca acctgcttgc agaccagagg gctccttttg ggggtttttg 780 
atggccatgc aggttgtgct tgttcccagg cagtcagtga aagactcttt tattatattg 840 
ctgtctcttt gttaccccat gagactttgc tagagattga aaatgcagtg gagagcggcc 900 
gggcactgct acccattctc cagtggcaca agcaccccaa tgattacttt agtaaggagg 960 
catccaaatt gtactttaac agcttgagga cttactggca aggagcttat agacctcaac 1020 
actgggtgag tcgactgata ttgatgttaa ggaggctcta attaatgcct tcaagaggct 1080 
tgataatgac atctccttgg aggcgcaagt tgggtgatcc taattctttt ctcaactacc 1140 
tggtgcttcg agtggcattt tctggagcca ctggcttgtg tggcccatgt ggatggtgtt 1200 
gaccttcatg tgggccaata ctggcgatat cagagccatg ctgggtgtgc aggaagagga 1260 
cggctcatgg tcagcagtca cgctgtctaa tgaccacaat gctcagaatg aaagagaact 1320 
agaacggctg aaattggaac aatccaaaga gtgaggccaa gagtgtcgtg aaacaggatc 1380 
ggctgcttgg ctttgctgat gccatttagg gcatttggag atgtcacgtt ccaatgggag 1440 
cattgagctt tcaagaagag agtgatagaa tctggcccag aaccagttga atggacatga 1500 
attttccaag tttattcctc ctaattatca cacacctcct ttatctcact gctgagccag 1560 
aggtaactta ccagcgagtt aaggccacag ggataagttt ctggtgttgg ctactgatgg 1620 
gttgtgggag actatgcagt aggcaggatg tggttaggat tgtgggtgag ttcccttaac 1680 
tggcatgcat caccaacagc caatagctgt tggtggctac aaggtgactc tgggacagat 1740 
gcatggcctt ttaacagaaa ggagaaccaa aatgtcctcg gtatttgagg atcagaacgc 1800 
atgcaaccgc gatcgtcatt cggccaccgc tgtgggcaac aacgagtttg ggactgttga 1860 
tcattgagcc gcttctctaa aatgcttagt cttcctgaag agcttgctcg aatgtacaga 1920 
gatgacatta caatcattgt agttcagttc aattctccat gttgtagggg cgtttcaaaa 1980 
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BNSDOC1D: <WO 01 62927 A2J_> 



WO 01/62927 ^fe. PCT/US01/06059 



tctcactggc aattctcaaa tgatacacat txaa 



cccagaatag tgagtgg;ct<: tttcactggc aattctcaaa tgatacacat ttaaagggca 2040 
gattttctta aaaagatact actataataa acatttccag ttggtcattc taagcattta 2100 
cccctttgga tactctagct agtcaggtac . tccaaattga cctttgcagc agggtggcag 2160 
ggtcaggaga gtctggtcct gcctagctca gatttcagtg gcagcctgca cttgaagcaa 2220 
cgtcacttct ttatcacagg tcgtcttgag acattcagct tcttttacca ' aacctgagaa 2280 
aattaggatg cacctggcca aataagatct tgaataggcc aaaagcaagt atcttgctgt 2340 
gtgtagtctc ttggttcaag tgaagcaaca gtactgttca cacctttctt cactgagatt 2400 
cccagtgtac atgagaacat atatttattg catgattttc ctagatacac agtcctatgc 2460 
attattcata tacatttatt ttagcctaaa agtgggtttc aaatcccagt tccttcaagc 2520 
cataaatgac caaggtccaa ggcaatctgg aatttgtttt. tgtgattatt tgactggaat 2580 
gcttcttaag tggaataacc tatactccgt tatccacccg atttcctaat gtaattgaaa 2640 
gattttccta ttttgccaca cacttggaga acaataatgg gtttttagtt ttatctactc 2700 
ttctattgaa gttaaagaaa gaaaaaaaga tttttttatt tggtattaat gaaaagcttt 2760 
agtttaaaat aaggagatcc agaataaaaa gaagagactg atctcttcaa ttattgtcat 2820 
ctgtagccac cagcacatca ctcttatgta atccccaaag gcttggcatg ccgtaagtgt 2880 
gtggtgggta gactgctgcc ggggaatcgt acttcttatt tagtaatgat aagacttttc 2940 
attaattttg ccatttttaa agatgacata aataagttta aatatcaatt tggggagtaa 3000 
ggtttaatat tgccatcggg tattgagaca ggaggaagtt tctgtttttc tccatttaga 3 060 
ccataggtca attaaaatat ttgggtttaa aaattgacct aaatgcttta aacatattgt 3120 
agcttaagat atatgtgtta agatatatac atgagaaact ttaaaaggta actactgtgc 3180 
atgcctgatg cttaatagaa tacttagtgg, catcaaatgt ttgcagcagt ctccatgagt 3240 
tatgttcggt cccttctaat actgtatcaa tgtaaatgaa ataaagtata ttcaaattgg 33 00 
ctttttgata tgcatca»agt ggcattttgt ctcctgtgtt taatagtgat ctgtatacag 33 60 
gctgtgcaca tattgtcatc acttattcta gcatcactgt taaaggctgt gattatgttt 3420 
gatattcacc tgcgatttta atacaagcca atatcagctt cccattgtgt aataacttgg 3480 
gtgtttagga gtcttttcac attttttggc gatatgaact agatgttcca agaaactcct 3540 
tcctggactg tggatactga atcagtgtac tattggctgc agaatttgtt tcaattgaaa 3 600 
atagactcag gaagattgct gctcagaata tcatatagat gtctattttt tgaggtgttt 3 660 
ttgtttttat ttgtgtgttt ttcttttttt taagtcagct tggaactttt ttcctgggta 3720 
gtatttggga gagggaaagg ctgtcactat atatttattt ctctaatgtt ttggctcggg 3780 
cctttttctc tttaatgaga atatagtgtg gctggtctca tgacaaaacc catattacag 3840 
tctactatga gaatattctt gaacaccacc actgaagagt ttcatataca ccaaataatg 3900 
tctcatctct atagtacagg gaatataaaa ttggtttcct gtggtcatga tccagatagt 3 9 60 
aggtattatt accacaagaa cttgggtctg ccagtcctgg aaagcttgtc tgctctatag 4020 
aaatgaaaat gcagcatgaa gttgacattg tggaaatgaa aagtaattgg gtattagaaa 4080 
tctgaaagta ctgtccatct aaaagcaatt gtggatttta ttgtaattgg ttgtcccctt 4140 
ttgttccggt ttcttagaat taaaagaata catgtaaacc tttccatggt atttaagccc 42 00 
tttccttaaa tttttttaaa atttaaactt tcctaaccct atgtattcaa cttctgtatt 4260 
tatatttaat ccagtggttc atgttatata atacaccctt aactagttaa atggaatgtt 4320 
ggtatggtac agagtaccat attgctaaga aaactgtctt ataaaagatg tatatggtgt 43 80 
gaagacatga aagtttaatg taacagaatg gttggagaaa tgcctatggt gaattaaagc 4440 
ttcatatctg ctttctgaaa aaaaaa 4466 

<210> 191 

<211> 1798 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: LI :33 5593 . 1 : 2000MAY01 
<220> 

<221> unsure 

<222> 111, 145, 147 

<223> a, t, c, g, or other 

<400> 191 

aaccagcagc gggaggagcg caacgctagc gcggcgcggg gaccgccgcg cttccttccc 60 
ggctcctgcc ctcctctgcc gctcgctcct ctctcggcat ccacctcctt nccctcctcc 120 
tgctcattca ctctttccct cactntnccc tcctcctcgg cttctccccg cgccccgcca 180 
gccctcgcgc tctcccactc cctctgcctg tcctccccgg tcccctcctc ccgctcattc 240 
actcgcccct ctcccttctc cactctctcc ccctgtctcc tttcttcttc ttctttcacc 300 
ctccgtctct cacaccccct ccattcccct gtctcctttc tgacactgca ctgcagctgc 360 
tcctcagccc tgccccctcc ccagtgagaa caaaccagca acattgcttt ttttcctaaa 420 
gagatttata ttgatccgat taaaaaaaaa aaaccttaag aaacccccga cgcccgaaag 480 
aaaaagaaaa aaaaaaagaa aaaagaaaag aaaaagccaa aacaaaaggg agaaccttct 540 
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f 

gcaggi 



cccggtagca gcggc^ggaa ctgcaaacat gatggcggca 
gacccacact ggggttccct atagacctgg acccgccgga 
aagccccccc tgaagccggc agcaccaaga ggaccaatac 
ttctaaaggg ttctcatacc tagttatgct gctggatcta 
gacaattgtt cagttgcaaa aagaaactgg agccaccatc 
tttttaccca ggtactagct gcagcgagtg tgctgtgagt 
tgaatgcatg gttcatggca cttcattgca tcagacacat 
aaatatggcc aagacagaac cagtcagcat tctacaaccc 
tcgcatcaaa caaacatttg ccatcttccc caatctacca 
cccatgagcc ccctccagag ctaatcaggt acagtatcat 
tttcaccggg tgattgcctg caaaactgga gcaacacaag 
ttcattcgtt cattcacata tatgtcacca tagtcagatt 
aatatctcat tattttctca acattttcta tgcctttcta 
tattcagata ttataactac tctaattttt taattaattt 
ctaatggggc acctgaagcc tatttattga aattggtaat 
ate t tec tag gaatttttca aaagtaaatt aaggatatct 
gtataaattg agcaaaggta tatctttaaa agtgaatgca 
gagacatgee ttatgatgta aacaactttc aactattgat 
taaaagaaac tgaaaattgg tatttgtggt aattttaatg 
tgttttctta gtattttaaa aaattctaag cattttcatt 
caagtagaat atattttatt agtttggatt tatggattgg 

<210> 192 

<211> 3690 

<212> DNA 

<213> Homo sapiens 



PCT/US01/06059 



gctcccatcc 
etcgeggaaa 
gggegaagae 
taattgggga 
aagctgtcta 
ccagggaact 
agttcgagaa 
cagaccaccg 
ccaagtcctc 
cctgtcatct 
atcatggtgg 
tccaataacc 
attttagata 
ttccctaaaa 
ttttaatctc 
atatatgtaa 
attttatttg 
gaaaagcatt 
tttaacaact 
getctgetga 
tttgggtact 



ageagaaegg 
aggcegctgg 
ggecagtatt 
agggaggaca 
agtccaaaga 
gttgaagtgc. 
tatgecctat 
ttaatccaga 
tccatctgat 
acaccatact 
gttccaggat 
tatagaaaga 
acaaatacta 
atttgactac 
cccttattgt 
aaaaaaaaat 
ctttttattt 
tctagcaatc 
caggagatgg 
caaactttgc 
atttgtag 



600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1798 



<220> 

<221> misc_feature 
<223> Incyte ID No: 



LI : 1189543 . 1 :2000MAY01 



<220> 

<221> unsure 
<222> 2889 

<223> a, t, c, g, or other 



<400> 192 

gaattccgac 

atgatggcgg 

gacccgccgg 

aggaccaata 

gctggatcta 

gccaccatca 

cgagtgtgct 

gaaaaaattc 

ccccagacca 

accaagtcct 

gttcccaaca 

atggagcagt 

gagagggttg 

atccagaaga 

gtgacaggtc 

agtgttacca 

tgcagccttt 

tgcacttaat 

tcaagcagca 

ageagecaat 

tggtacggcg 

atattttgga 

agatggatcc 

tggcaaagga 

ctccaaaaaa 

accagctgca 

agttcgggct 

ttaacccctc 

tttctgttat 

taaataatat 



aaaacaaaag 
cagctcccat 
actcgeggaa 
egggegaaga 
taattgggaa 
agctgtctaa 
tgatccaggg 
gagaaatgee 
ccgttaatcc 
ctccatctga 
gcacagcagg 
caggggcttg 
tcactgtgag 
tacaagagga 
cagtgggcaa 
actgetgeag 
ccagcagttt 
acattageca 
gcaacagggg 
ttattggcca 
gggacatttg 
gctgcttctc 
aaggatgtag 
gggaaaacat 
ggagaattcg 
acacaggctg 
gccaatcctc 
ctttacccca 
taatatataa 
gtgttttacc 



ggagaacctt 
ccagcagaac 
aaggccgctg 
eggecagtat 
gggaggacag 
gctgtctaag 
aacggttgaa 
ccaaaatgtg 
agategcate 
tcccatgacc 
tctgataata 
ggtgcagctt 
tggagaacct 
tccacaaagt 
attccaatcc 
cagctgeagg 
tatctggctt 
gctatggata 
ctttggctgc 
cctatgccag 
cattaggtag 
ccctagctgc 
ttgaaatagc 
tagtggaata 
tacctggcac 
ctcaatattt 
agaaagtggg 
ttttcaagaa 
ttatgcaaat 
agatgtttca 



ctcccggtag 
gggacccaca 
gaagcccccc 
tttctaaagg 
acaattgttc 
tccaaagatt 
gcactgaatg 
gecaagacag 
aaacaaacat 
acctccagag 
gggaagggag 
tcccagaaac 
gaacaaaacc 
ggcagctgtc 
aaceggatet 
gctattagga 
cacaggcaat 
taatctcaac 
agcagctgcc 
tgaagectea 
cctggctgct 
cagtgecatt 
agtgccagaa 
ccaggagttg 
aaggaategg 
aattacacaa 
ttgagtgccc 
ggatgtactg 
gaatgegact 
tagaaagaat 



cageggcagg 
ctggggttcc 
ctgaagcegg 
ttctcatacc 
agttgcaaaa 
tttacccagg 
cagttcatgg 
aaccagtcag 
tgccatcttc 
ctaatcaggt 
gtgctactgt 
ctgatgggat 
gaaaagctgt 
tcaatatcag 
ecttatgeaa 
catgctaacc 
gacctggtgg 
actttaggtt 
agtgccaacc 
gccagtggca 
getactgetg 
ctaggaacag 
aacttagttg 
actggtgcaa 
aaggtaacca 
aggatcacat 
cagttacaca 
tactttgeag 
atgttgacaa 
tttttcttga 



aactgeaaac 
catagacctg 
cagcaccaag 
tagttatgct 
agaaactgga 
tactactgag 
attcattgea 
cattctacaa 
cccaactacc 
aaagattata 
gaaggctgta 
caacttgeaa 
tgaacttatc 
ttatgccaat 
acactgetga 
ttgctggcgt 
ccatcacctc 
taggtctcag 
cagcagcagc 
gcacagctgg 
caaccaatgg 
aaaagtccac 
gtgeaatact 
ggatacagat 
ttactggaac 
atgagcaagg 
tcagattgtt 
aagtgaagtt 
tgtgtatatg 
tctgttttgt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 
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BNSDOCID: <WO 0162927A2_I_> 



WO 01/62927 



tctctatact 
ccattttacc 
tttcagtatt 
cagtgaccta 
ggcacagtga 
accgccatat' 
ccattatatg 
atggaaataa 
agttctcaca 
tcactactga 
gtttagacac 
caacaaatta 
cattttaaga 
aatgtagatt 
ttgcagcatg 
ggagcatttt 
cttatcttga 
taagcctttg 
aactgtgcna 
actcataggg 
gcggcatgca 
gaattacaaa 
caagctaaat 
gatgctccat 
actttccctt 
aatttacttt 
taataattca 
cattggactg 
ttaaaaggga 
attggtaatg 
atctcgtgga 
atgtgcttag 



ttgcttgtgt 
agcattattg 
agttctagat 
aaatggagtg 
cagtcggaaa 
acgttgccct 
ggtaagttta 
tgctjgcatgc 
tggagtccca 
tcccatcgcc 
ataactgtgg 
ttgatagctg 
gcttataatt 
ttactgtgaa 
gcgctagcag 
tatgtcacca 
gaggtagtgt 
ttbccaagac 
ggccatttga 
gcatcctagc 
gtctaatcat 
agagaattac 
ggcatcccag 
gtctgaattt 
ttgttcattg 
tggacattga 
tttctgtcaa 
ttatttcatt 
atgatatgcc 
actatttatc 
atcagtggtt 
ttcatcatgt 



atatttgtca 
tagtttaatg 
aacactaaat 
aggccaaagc 
catatttcga 
acaacgccct 
cagatcaata 
tggccatggc 
cctcttcaga 
tggccagtag 
tacagtgtat 
ataatgtttc 
gtgtataatt 
gctatctgtg 
ccaatgcagt 
accagacagt 
tatctacctt 
ccaacaactg 
cacaccatct 
cctgtgcact 
gacaaataat 
ttccatctca 
ctgctccttt 
cgtccagcct 
aactatatgg 
gaacttgaaa 
actgtaagag 
tagagttctc 
ttgtttaaat 
gtcgatactc 
agcattgccg 
aaaaaaaaaa 



gaggtgtttc 
attgaatgta 
taactactgt 
actgtcctgt 
aactaacgtc 
gtcaactatt 
aaagcacctg 
ccattcttca 
ggtfctgtaca 
aacagtcatt 
tggaaatttt 
attatctgca 
tgttaacact 
atttagctgt 
ttctaatact 
atttcctgca 
tgtcaggctt 
tcaccctctt 
gttaacctct 
cagcactcat 
gctgctactc 
acttcaaccc 
ctgtgcaacc 
ctctctgaac 
actctgtcat 
ttatttcctg 
tagactcatt 
tatctctaaa 
gaaagagaaa 
tgtagctgtg 
ctattatatt 



PCT/US01/06059 



tagtgtaaga tttaagcctg 1860 
gacagggata tgcgtatagt 1920 
taggttgagt atggtggggt 1980 
aagtcttact tcctgcttag 2040 
tcaaatttaa aatgatcaaa 2100 
ctacacggtt aaactgtgga 2160 
ttctccatct gaactagaca 2220 
tcatttgtaa gttcaacaaa 2280 
tttgttttta agcactgaat 2340 
actccattaa catcctcact 2400 
ataaacaaaa gtgaaagtgc 2460 
actgcttgat aagtatgttg 2520 
agaaacbtat tagtattgtg 2580 
ttgctcccat gatggagtct 2640 
cggtaatttg catgttttgt 2700 
tgcttattta gaagaggcag 2760 
tttgacaggt catttcagag 2820 
ctgtacctct cctgagtgcc 2880 
gagtttgccc actcaaggcc 2940 
aggatcatcc agactctcat 3000 
tgatatctgg ctgagcaact 3 060 
attgattacg tccatcctag 3120 
aattaaagaa caatgagtgt 3180' 
tgtgatcttt gtcctcatga 3240 
ttcatattga tttactgtgc 33 00 
atcccttccc cttccactat 3360 
tttttttttt tagtttttaa 3420 
tatttattta gagaatgatt 3480 
agctgtagta aactgtgtta 3540 
taagttttga caaatagtgt 3 600 
tactcatttt atcattataa 3 660 

3690 



<210> 193 

<211> 588 

<212> DNA , ■ 

<213> Homo sapiens 

<220> 

<221> mis cofeature 
<223> Incyte ID No: 



LG: 455450 . 1:2000FEB18 



<400> 193 

ggcggcggca 

gacgacggca 

ccaggaggcc 

gggtacgtgg 

atcaaggaga 

gcaaccggtg 

actgaaggca 

cagcactaag 

ccggttggcg 

cggcagctac 



tccttcgccg 
tgggtgggcg 
acagggaagg 
ctgggccgac 
agatcccagg 
gcggcggtgg 
ctcacgagaa 
cactaagcag 
actgcctgta 
gtatgtgtgt 



ctccggcagc 
ccggaagaag 
ccagggccag 
gaccggaggc 
cggccacaag 
ctatggcgga 
gaagggcttc 
ctgcctcttt 
agtacatgta 
gtattcgtgt 



agctccagct 
ggcctgaagg 
gcgacggcca 
ccccacgaga 
gactacgacc 
accaccgaca 
atggacaaaa 
cctcttccgg 
tacagaataa 
gtatgcattg 



ccagcagctc 
agaagatcaa 
ccggtgcgta 
agaagggtgt 
agcatcagca 
ccacgtatgg 
tcaaggagaa 
ctagctgcct 
ataaagcaat 
taaggcag 



gtcatctgaa 
ggagaaaatg 
cggcgggaca 
ggfcggagaag 

caccaccgcg 
gacgacgacc 
actccccggt 
ctttcctctt 
aaagaataag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

588 



<210> 194 
<211> 640 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> mis cofeature 
<223> Incyte ID No: 



LG:1040978 .1:2000FEB18 



<400> 194 

ctcctaccta cgcagcagag gccagccacc tccccgtcgc tcfccacgctc ttcgtgcocg 60 
ccgcctctcc tccgtctctg cctctctccc tctcccctcc cgtctgacgc acatggcctc 120 
catcgccgga tcttcggccc tctccttcgc caggcccgtc aaggcaatca acacaaattc 180 
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tctagctttc 
gagatctgtt 
gaagcagctt 
cggtgcagat 
tagcgtggag 
cgacaagctt 
ctaggtctat 
gttgttgcta 



tccccggcga 
tcctgcgctg 
gctcttcctg 
tcactggaca 
gagtccagcg 
gttgctggca 
cccttgatct 
tccccgtgtc 



ggaagggcaa 
ccaagaagga 
accatactga 
cggttgagat 
cacaaacaat 
aagcgtagtt 
aggtttctat 
gtaataatac 



cacattcctc 
tacaacagac 
agtttgcggt 
tgtgatgagt 
cgccacagtt 
ctgtacctcc 
ggtctgaaga 
gagatgtttt 



cgcctgcaac 
aaggtttgtg 
gaatcgaaat 
ctcgaggagc 
gaagacgccg 
tctatcgttg 
tttgtgattt 



caatgcctat 240 
agattgtgaa 300 
tctctgaact 360 
acttcgatat 420 
cggacctcat 480 
tcctaccact. 540 
tctccgcact 600 
640 



<210> 195 
<211> 380 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 



LG: 44 6649 . 1 : 2000FEB18 



<400> 195 

gctgaaagcg 

tctcaggctg 

tgatgaagag 

cacagatcgg 

gctgaaagga 

aaagaagaaa 

tacctgtgag 



cgggagcaag 
attttgagaa 
atgctgttca 
ccggggctgt 
acttccaagg 
tatggaatat 
gacataatgc 



ctcagaagag 
aggcgggtga 
tctacagcca 
tggacctcaa 
aaaatgccat 
taacgaccag 



ggaagcaatc 
ggaggtgaag 
cttcaaacaa 
gggcaaagcc 
gaagacctat 
atttggtggg 



caggtcacct 
cgcctcaaga 
gctactgtgg 
aagtgggact 
gtggaaaagg 
cagccacatg 



cgccagtatg 60 
ctcagccaac 120 
gcgatgtaaa 180 
cgtggaacaa 240 
tagaagagct 300 
tgtgacataa 3 60 
380 



<210> 196 

<211> 3854 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc__feature 
<223> Incyte ID No: 



LG: 132147 .3 :2000PEB18 



<400> 196 

ccgcgtcctc 

cggttgcagc 

tcgccgtcgc 

ccggaagccg 

tggtgaaaaa 

ctgggcaatg 

attatgacct 

agatccataa 

tcaaccgcct 

acaatgatac 

caggaggaca 

aagaaaggag 

aatgggagcg 

gc tttgttga 

cagatcccag 

gaacacattt 

aaggccaggt 

tgcccaggga 

agatccgtaa 

aatttacaga 

tgaaagacca 

cagtcccaag 

cccaacaaca 

aggaatcata 

taaaatttcg 

tcttgtcaca 

tttatacatt 

actttgttgg 

tcacattgcc 

tagtagatcc 



cgcggctcag 
gccaaaagct 
ccctgtcccc 
gaggaacggg 
ggattttttc 
ccattctaca 
gtatattgga 
gaaacaaggt 
caaagacact 
agttagagga 
agttgtggac 
aaccgcctct 
cccaacacga 
tgagaacact 
gctggcagag 
acatactcct 
gtatttctta 
tcttagcaac 
tacggcaaca 
tcctcggctg 
acagcaacag 
gtacaagcga 
gcctcaggca 
tcgacaggtc 
tggagaagaa 
tgaaatgttg 
gcagatcaat 
acgaataatg 
tttttataag 
ggatcttcac 



gacgactacg 
cgggcctcag 
gtcctgcccg 
cccgtcaagc 
cgacttcctg 
gatactgtga 
aagtctgatt 
gctggatttc 
ggttatcaga 
cagatagtag 
tgcagtcgtt 
ggaagaatcc 
ccggcatccg 
ccaattagtg 
aggagagtca 
ccagacctac 
catacacaga 
atcaattgtg 
ggcagagttt 
tctgctaact 
caagtggtat 
gacctggttc 
ggtcattgcc 
atgaaaatga 
ggccttgact 
aatccatact 
cctgattctg 
ggaatggctg 
caattgcttg 
aacagtttag 



ctcggcacgt 
cccctgacgc 
ccgcctccgc 
tgcacctgac 
atccatttgc 
agaatacgct 
cagttacgat 
tcggttgtgt 
ggttggattt 
taagtcttca 
tatttgataa 
agtatctaaa 
aatattctag 
gaacaaatgg 
ggtcacaacg 
cagaaggcta 
ctggtgtgag 
aagagcttgg 
atttcgttga 
tgcatttagt 
cgttatgtcc 
agaaactaaa 
gcattgaggt 
gaccaaaaga 
atggaggcgt 
atggcctctt 
cagttaatcc 
tgtttcatgg 
ggaagtcaat 
tgtggatact 



ccgggaccct 
tgtggtgact 
cgctggggac 
agtactctgt 
taaggtggtg 
tgatccaaag 
cagtgtatgg 
tcgtcttctt 
atgcaaactc 
gtccagagac 
cgatttacca 
ccatataaca 
ccctggcaga 
tgcaacatgt 
acatagaaat 
tgaacagagg 
cacatggcat 
tccattgcct 
ccataacaac 
tttaaatcgg 
tgatgacaca 
aattttgcgg 
ttccagggaa 
tctctggaag 
tgccagggaa 
ccagtattca 
ggaacattta 
acattatatt 
taccttggat 
tgagaatgat 



ccggccgtgg 
ccgccgcgcc 
atgtccaacc 
gcaaaaaacc 
gttgatggat 
tggaatcagc 
aatcacaaga 
tccaatgcca 
gggccaaatg 
cgaataggca 
gacggctggg 
agaactacgc 
cctcttagct 
ggacagtctt 
tacatgagca 
acaacgcaac 
gatccaagag 
cctggatggg 
agaacaacac 
cagaaccaat 
gaatgcctga 
caagaacttt 
gagatttttg 
cgattaatga 
tggttgtatc 
agagatgata 
tcctatttcc 
gatggtggtt 
gacatggagt 
attacaggtg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 
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ttttggacca taccttctgt gttgaacata atgcatatgg tgaaattatt cagcatgaac 1860 
ttaaaccaaa tggcaaaagt atccctgtta atgaagaaaa ..taaaaaagaa tatgtcaggc 1920 
tctatgtgaa ctggagattt ttacgaggca ttgaggctca attcttggct ctgcagaaag 1980 
gatttaatga agtaattcca caacatctgc 'tgaagacatt tgatgagaag gagttagagc 2040 
tcattatttg tggacttgga aagatagatg ttaatgactg gaaggtaaac acccggttaa 2100 
aacact'gtac accagacagc aacattgtca aatggttctg gaaagctgtg gagttttttg 2160 
atgaagagcg acgagcaaga ttgcttcagt ttgtgacagg atcctctcga gtgcctctgc 2220 
agggcttcaa agcattgcaa ggtgctgcag gcccgagact ctttaccata caccagattg 2280 
atgcctgcac taacaacctg ccgaaagccc acacttgctt caatcgaata gacattccac 2340 
cctatgaaag ctatgaaaag ctatatgaaa agctgctaac agccattgaa gaaacatgtg 2400 
gatttgctgt ggaatgacaa gcttcaagga tttacccagg actctattta tacaacccca 2460 
actgacagcc tcctttcagc agagtttcaa agaatatgct gaaatacagg aaaacactcc 2520 
cccccccccc cttttttttt tttttttaca ttttaggaca ctgtgagggg aaaggacaat 2580 
tttgaaattc cttttcaagg aaaaaaaagg tctttatgct ttgccatgag gccacattca 2640 
gctgctattt aaacttaata tcttgaacct aaagaatgct gacttttcct acatttccag 2700 
agttaggcag tattctacac ttaaagacta" ctactatttt tataaaaggt aatctattca 2760 
aatttcttca cagatttcaa gtctctcaaa catcaagaca acttcagcag tcggtacaag 2820 
tcacatttca ttttgattga atacatgatc ttgaacagct cctgtacttg ctctttgtaa 2880 
aaaaaaataa aattattttg aattattcta cctttgtaaa caattggcta aaagaatcat 2940 
ctttaagaaa ttaagccatt tacatgtttg tgtttttcta tagcagagca ttatattttg 3000 
cattatatgt ttcaacctag tctaagtggg tcttttttac atttttcaag aacggatttc 3060 
ctggaataca gcgatataat tttggttgtc aaattcctaa tgcaaccatt tagtctaaac 3120 
ttagtcattt atttgtgaca ataagatgtg ttcaggggct ccctgttttt aagagactct 3180 
ttaaaaaaaa aaaaacctaa tgtttttatc ttgagtcaat atgattaggt attttggatt 3240 
tacttttaat cttaaaatac tgcattttta tagcttctca gagcatgtgg atgggatggg 3300 
attttcgtta ttttgctggg tcagcttatc tttaatatat ggactattcc tataaaccaa 3360 
agtctctgac aagtgcacct aatttatatt gtattttaac tacagtgtaa gtttccatta 3420 
acaaaaccat cctaaagcgt taactgctca taattttaat cagctacagt tatgaaaaag 3480 
gaagaatttt gctctaaaga ttattaataa gcttagaaaa tcctgctttt acctaacaga 3540 
atgaagtggg gttgaggggc ggtagtatga acagtgattc taactctatg gcgtaaatta 3600 
ctttgaagag ttcattttgg aagtaaggga atcccgcttc ccttggtgtt gaagatctgc 3660 
atctgatttt gaaacgtcca gatttagctt tcaaggtgcg tgcatgttga atgatccaga 372 0 
agcaacgaaa actctggggt gagggattga aaaaatctga aatcacagtt ggatttttaa 3780 
attcaaccat caaaaatatg tactatcagt ttttttctac attgatgtga aataagcaat 3840 
tttcatgata atta 3854 

<210> 197 
<211> 905 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 036034 . 1 : 2000FEB01 
<400> 197 

gggaggatga cctcttaccg 
gagccccagc gacacccgag 
cgaagaaatc cagaggatgt 
ctctctgcca ttttcttagt 
tgaaccattc atctgaaaac 
ctacacaaag aactatggca 
ccatttgcta aaggaatagt 
caatatgctg aagacctgct 
cagcttgaag actttttcaa 
gctgaagctc actcagattc 
tccagtgaca gtgatctggt 
gatggaatgg cttccaatca 
gattcttcag cactaggggc 
cgggaaagtc gtagcctctt 
ttggagaaag caggagaatt 
gaagc 

<210> 198 
<211> 585 
<212> DNA 
<213> Homo sapiens 
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ggagcggagt gccgacctgg cccgtttcta cactgtcacc 60 
gggctacaca gtatataagg tcaccgcccg ggttgtttca 120 
ccaggaggga gaatcctgaa ccaacctatc catgaacata 180 
ccttttttcg ggaaaactct tttataataa caactgttgg 240 
tcttataatt gtatggaaga gatacagtga ttttaagaaa 300 
aattcacaaa aacttattcc gacattcaga gttgtttcct 360 
gtttgggcga tttgatgaaa ctgttatcga agagagaaga 420 
acagttctct gccaatattc ctgctcttta caatagtaaa 480 
gggtggaata attaatgata gttctgaatt aattggtcct 540 
cctcattgat acctttcctg agtgtagtac ggaaggcttc 600 
ate tct tact gttgatgtgg attctcttgc tgagttagat 660 
aaattctccc attagaactt ttggtctcaa tctttcttcg 720 
tgttgcttct gacagtgaac agagcaaaac agaagaagaa 780 
tcctggcagt ttaaagcega agcttggcaa gagagattat 840 
aataaagctg gctttaaaaa aggaagaaga agacgactat 900 

905 



WO 01/62927 



PCT/US01/06059 



<220> 

<221> misc_ feature 

<223> Incyte ID No: LG: 162161 . 1 : 2000MAY19 
<400> 198 

gagtcttcgg cgagacctac cgggccctgt ctgtgccggc cgtgagctgc ggcaggacgg 60 
ctggagatta cttctctaac ' aggatcacca gtttgctcca caaagcacaa tgtctcgatc 120 
acgacaaccc' ccccttgtga ccggcatctc tccaaatgaa gggataccat ggacgaaggt 180 
cacaatcagg ggagaaaatc tggggactgg ccccaccgac ctcataggct tgaccatttg 240 
tggacataat tgcctcctga cggcagaatg gatgtctgca agtaaaatag tatgtcgagt 300 
gggacaagcc aaaaatgaca aaggagacat tattgtcacc actaagtcag gtggcagagg 360 
aacctcaaca gtctctttca agctactcaa acctgagaaa ataggcattt tggatcagtc 420 
tgctgtgtgg gttgatgaaa tgaattatta tgatatgcgc actgacagga acaaaggaat 480 
tccgcccttg tccttacgtc ctgctaaccc gcttggcatg gagattgaac caagtacatt 540 
ttcgcagaag gacttagaaa tgctattcca tggaatgagt gctga , 585 

<210> 199 
<211> 518 
<212> DNA 

<213> Homo sapiens ' 
<220> 

<221> misc_feature . 

<223> Incyte ID No: LG : 407214 . 10 :2000MAY19 
<400> 199 

gaaaagagtg ctccaccttc actgaagcca gcaatttggc cggctgttag cacttcattc 60 
atattcccta gtatgagtta attctttgtc attttatttc ttgtgattcc agaatctcat 120 
tttcagcttc tgccagagcc agtagcagtt gggaaagaag gctgtgctct ttgaagagag 180 
tcagccagcc tgaaagagca ggatggatct tgatgtggtt aacatgtttg tgattgcggg 240 
cggcacgctg gccatcccaa tcctggcatt tgtggcttca tttcttctgt ggccttcagc 300 
actgataaga ate tat tat t ggtactggcg gaggacattg ggcatgcaag teegctatgt 360 
tcaccatgaa gactatcagt tctgttattc cttccggggc aggectggge acaaaccctc 420 
catcctcatg ctccacggat tctctggcca caaggatatg tggctcagtg tggtcaaagt 480 
tccttccaaa gaacctgcac ttggtctgcg tggacatg 518 

<210> 200 

<211> 1233 < ; 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG:204626. 1-.2000MAY19 
<400> 200 

gcagaaccag attctccagg tgtgaagtaa gcaatctatt ttcacttget attaacagca 60 

aacgtacaac tttatctctt atttgacttc attatttggc tttcaaatta acaacatgaa 120 

aactatacag aaattagtct gtgecctata agacagagat ttagataatt tgttcaaaac 180 

atgagatgat attcagatat tctctcaaag gcaggactcc aacaccgcag gtggaaaacc 240 

aacaaaagga agaaaacaga aaggtggggg agataaggag tagecaggaa aaagaaaata 300 

gtgataaagg aagagccaaa aaggaggaaa gggaagatga aagaccgact tcaagaacta 360 

aagcagagaa caaaggaaat tgaactctct agagacagtc atgtatcaac tacagaaaca 420 

gaggaacaag gggtgtttct acagcaagct gttatttatg aaagagagee tgtagctgag 480 

agacacctac atgaaatcca aaaactacag gaaagtatta acaatttggc agataatgtt 540 

caaaaatttg ggcagcaaca gaaaagtctg gtggcttcaa tgagaaggtt tagtctactt 600 

aagagagagt ctaccattac aaaggagata aaaattcagg cagaatacat caacagaagt 660 

ttgaatgatt tagttaaaga agttaaaaag tcagaggttg aaaatggtcc atcttcagtg 720 

gtcacaagga tacttaaatc teagcatget gcaatgttcc gecattttea gcaaatcatg 780 

tttatataca atgacacaat agcagcaaag caagagaagt gcaagacatt tattttacgt 840 

cagcttgaag ttgctggaaa agagatgtct gaagaagatg taaatgatat gcttcatcaa 900 

ggaaaatggg aagtttttaa tgaaagctta cttacagaaa tcaatatcac taaagcacaa 960 

ctttcagaga ttgaacagag acacaaggaa cttgttaatt tggagaacca aataaaggat 1020 

ttaagggatc ttttcattca gatatctctt ttagtagagg aacaaggaga gagcatcaac 1080 

aatattgaaa tgacagtgaa tagtacaaaa gagtatgtta acaatactaa agagaaattt 1140 

ggactagctg taaaatacaa aaaaagaaat ecttgeagag tactgtgttg ttggtgctgt 1200 

ccatgctgta gctcaaaata aagaagctat ttt 1233 
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<210> 201 
<211> 1002 
<212> DNA 

<213> Homo sapiens 

<220> ' 

<221> misc_f eature 

<223> Incyte ID. No: LI : 007401 . 1 : 2000MAY01 



<400> 201 

gcggccgctt 

cagtctcctc 

ctcgcccggg 

ggcccccctc 

ctatgatgcc 

agttgggctt 

ccctgtacaa 

gagtcagaaa 

tgatcaagct 

gttagccata 

tgaagagact 

cagtagatga 

aaactgaaac 

tagagcctgt 

cccctgccaa 

agcctgttcc 

cttctcgacc 



tgccggtcat 
gctacagatc 
tggaggtcga 
acccgcgggt 
cctgaagatg 
tcaacattca 
gaattgaaac 
gtactacagc 
actgccagtc 
gatcaagtac 
gaattagaat 
aaccacgaag 
agaagtattg 
gtcctcagac 
acccccaaga 
agcacgccca 
tgctccacca 



tcccgaagcc 
gtctgctccc 
cgaggaggag 
gtgtcctata 
tgcacctagg 
aggaaacaga 
aagatgtgtc 
ttgaagatga 
ctattgtggc 
taccggaaga 
ttaaaaaaat 
ttaactcaaa 
aacaaggaag 
tccttatcta 
caccttactc 
cctcctccaa 



cggcaactga 
tcagcctcgc 
acaagagtca 
aatggcgtcg 
gggcggctac 
gaacactgca 
taacaagatt 
ctctttggat 
tagaacagat 
atcccaaaag 
gctttccttc 
caagttcaac 
cagtggaagt 
ctaaagattt 
cagagcctga 
ctaatttccc 
ccaagaaaaa ggaaaagcga 



gggcggcccc 
ccgagaccca 
cccttcctcc 
gaaagcgaca 
cccgtggggt 
tacaaagttg 
atcgaaagta 
tccacaggaa 
ctgagcaata 
gcagagagtc 
tgatgaaacc 
tgagcagctt 
caaaggaggt 
tgccgctgtg 
tatagtggct 
acctcctaga 
at 



ctttccttaa 
cttccccagt 
aggcggcgcc 
ccgaggaatt 
ctccaggaaa 
gaaatgagtc 
ttatcgagga 
aagaactctc 
tacccggact 
agaatacatt 
tgtgagaaac 
aatgtgcttg 
ggtgatgttt 
gaagaagtgg 
agtacaaaga 
cccccacctc 



<210> 202 
<211> 444 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> mis cofeature 
<223> Incyte ID 



LI:476342. 1.-2000MAY01 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

10O2 > 



<400> 202 

ggacggacgc 

cgatggagct 

tctcacacta 

gcaagagcca 

tcaaggagga 

aggcgtgcta 

actaagtaac 

gtgtcatgta 



gcgtctccgg 
catcaagtcc 
cgacggcggg 
gcaccactcg 
cggcgggtgg 
agcaaagctc 
acccttggct 
acgtgacagc 



tcccgtccgt 
agggcgaccg 
acgacgacga 
ttcccctgca 
accgccggct 
ttcaaacacc 
agctgtgcac 
aata 



aataacactg 
tgtgcgcgct 
cgatggtggc 
tctccgaccg 
actgccacct 
cttggcttgc 
aacctacgta 



atccgatcca 
cctcctggcg 
ggaggcccgg 
cctctgcagc 
ccgctactgc 
cagaactgaa 
ccgtgcatgc 



cgccggccgg 60 
ctgctcctgc 120 
gtgtgcatgg 180 
aacgagtgcg 240 
aggtgccaga 3 00 
ctctagtagt 3 60 
atgtaatgtg 420 
444 



<210> 203 
<211> 1956 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
<223> Incyte ID No: 



LI : 1072759 . 1 : 2 000MAY01 



<400> 203 

tctagatcgc 

gtctgtgaac 

acacttggtc 

catctgattc 

aggaggctgc 

tgaaaggcta 

gaaccagcaa 

caacacaaca 

gctggtgttg 

atgcccttct 



gagcgggccg 
actaccccct 
cgtcacttgt 
taataaatgc 
tgagcatggg 
gagcgtgaac 
gtactatgtg 
tgcattacag 
gtgaatttga 
ggccgtaccc 



cttttttttt 
aaaagtccaa 
gacacggttg 
ggtggcatcg 
aaagggctcc 
gtggtatcac 
actatcattg 
ggtacatctc 
cagctggtat 
actggggtgt 



cgcaacgggt 
taactgggaa 
aatccgggca 
aacaaaagaa 
ttcaagtatg 
cattgatatc 
actgccccag 
aggctgactt 
ctccaagcaa 
gaaaccaact 



ttgccgccag 
caggaaactg 
aggtcccacc 
ccattgaaaa 
ccttgggtct 
tccttgtggg 
cgacacagca 
gtgctgtcct 
tggggcggaa 
aatagctcgg 



caacacaggt 60 
actcatatta 120 
aattatgggc 180 
atttgacgac 240 
ctggataaac 3 00 
aaccttctga 360 
gacctttact 420 
gattgttgct 480 
cc c ggagagc 540 
tgttaacaaa 600 
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agtggattcc actgagtcaa 
aagtcagcga ggtacattaa 
ccaatttctg gttggaatgg 
ttcaaggaga tggaaagtca 
ttgcaggcta ctagactgca 
ccgtgcactc tccaggatgt 
gcgatgtgga gactggtgtt 
tcagacgtta acatctcgaa 
aagctcttcc tgggggacaa 
gttcgtcgtg gcaacgttgc 
gtggcttcac atgctcaggt 
ctatgctctc tgtattggat 
tctgaaggaa aagattgatc 
aacgtgaaga tctggtgatg 



gctgttgaga 
agactagaca 
gtagctggtc 
ttatccccta 
aacctgtttg 
taacaatgca 
ttttctattt 
aatggaaaca 



gcttctctag 
gttgcggtgg 
ataggtctac 
atacctgcct 
ttttcaattg 
tcgtaaaccc 
ttttgcgtgt 
acttgaccaa 



taaatgcaga aaaaaaaaca 

<210> 204 

<211> 1726 

<212> DNA 

<213> Homo sapiens 



cctaaaggcc 

gaaaaatggg 

tgacaacatt 

cccgtaaggg 

tcctaccacc 

ctacaaaatt 

ctcataaccc 

gtaaacatct 

tgtgggcttc 

tggtgacagc 

gattattcct 

tgcctactct 

gccgttctgg 

ctgcgcatgt 

actatccacc 

gtgtctatct 

ccaagtctgc 

acccctactc 

gccaatttaa 

cttcagaagg 

ggcagtttca 

aaattgtgtc 

aaaaggtggc 



cgaagaagat 
cttacaaccc 
ctggaagcaa 
atggcaatgc 
aaactcgtac 
ggtggtagtt 
gcgtatcggt 
gtcgaaatgg 
aatgtcaacg 
aaaaattgac 
tgaaaccatc 
ggctcactat 
tataaaagtc 
gttgatatgg 
tttggggtcg 
aatagtctag 
ccagaaagct 
ttaatcagtg 
agttaagtag 
aacaggagaa 
agttattagt 
acagaaattt 
gggccg 



PCT/US01/06059 



atgaggcaac 
cgacacagta 
ggtgctatca 
.caggtgagaa 
aactgacaag 
ggtactgtgt 
aggtcacctg 
gccctgaaag 
aatgtgtctg 
ccaccaatgg 
caggccaaat 
tgcatgcagg 
tggaagtatg 
gttcctggca 
ctttgctgtt 
tggactaaga 
cagaagggtt 
ggtgaaagaa 
taaaagactg 
tgttttgtgg 
ttttaaaatc 
tgagacccat 



ttgttcacgg 
gcctttggtg 
ttgccttggt 
ccacgcttgc 
gcccttgtgc 
cctgttgtgc 
ttgctccgag 
gcttggagtg 
tcaagggact 
gagcagacgt 
aatgcgccgg 
ttttgctgag 
gccctaggat 
aggcccatgt 
cgtgatatgt 
taggctgctg 
aattggaata 
cggttctcag 
gtttaatgaa 
accactttgg 
agtacttttt 
taaaaaaagt 



660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1956 



<220> 

, <221> misc_feature 
<223> Incyte ID No: 



LG: 998857 . 1 : 2000FEB18 



<400> 204 

ctccatgcca 

gatgaggggt 

atttgtgccc 

gacgtcagtg 

aggtgcaggg 

caggaggtca 

cccggggttc 

ggtcatgggg 

gaaccaaccg 

gggcaccaac 

ctcggagtgg 

gccaaaggag 

cgtgcggggc 

ggtgccgtag 

cgggggcatg 

ggggcccagc 

cttccacagc 

ggtgttggaa 

ttcagccgcc 

actctccttc 

catgcacttc 

gtctttgatg 

atggaatgtc 

ctgcaccaag 

cttctggatg 

gtagttgttg 

cttgtagatg 

gtcccgggtc 

aggtagtccc 

<210> 205 
<211> 1595 
<212> DNA 



ctctgacacc 
gctgagcggt 
ctcgggaaga 
gcaactgctg 
ctgggggcac 
tcgaggctgt 
atgggtgtca 
gtcatgggat 
ctcgcggacg 
acctccacca 
gagaccaggg 
ttggagcgcg 
agggagcgac 
gagccggagc 
tccaggcagg. 
aggccggggg 
agcacgcggt 
agtagcaggt 
cgctgactct 
atctcccgca 
tccaggttgg 
aactgcatgt 
tgcaccacca 
atctccccca 
gcactgaagt 
cccaggtaca 
gccatggtgg 
tgagcaggag 
tcaggtgccc 



ctggcagtgc 
cattggtgat 
gctcctgtgg 
tgctgcccat 
ggcttggcac 
ggctgccccg 
tgggtgtcat 
tcacgggccc 
agccctccag 
cgtccccagc 
cgcggactct 
agctttgggc 
ggtctggctc 
cgtgccggct 
tgggcgtcag 
agtgggcgcg 
tctggagcat 
gcttctctgc 
cagacacgaa 
cgttcttgtc 
ccgctcggtg 
ccagcttggt 
cctccaggtc 
gaatctgtga 
agacctcggc 
ccaggttctc 
acctgtagaa 
gctgggagct 
acgactcagc 



actgtgggta 
ggtgggacga 
tgggtactgc 
gctgctgcgg 
ccggcttggg 
gagtgggtag 
gggggacatg 
ctcctccaga 
cttgccgtag 
ggagaagcgc 
cctggcgccc 
gctgccgctg 
tagctgggac 
gcggggggag 
gcggcccgag 
cgacgggctg 
cccccgggcc 
taggaagcga 
ggcctgcatc 
tctcttgcgc 
gcggtactcg 
gttcttctcc 
agagttcaag 
ggtggggctc 
cgcctcggac 
cagggcgggg 
ctggtccatc 
ggtggcgatg 
tggcagcgag 



cacctcgacc 
agcttgacag 
tctgaggaca 
cggctgctgg 
gtggagttgc 
gacctggaag 
gaggtcatgg 
gccttcacgt 
agccagccgt 
agcagcgtgt 
ccgccgccgc 
tagagcgagg 

gcgggcctcg 

ctgaactctc 
gggtagggcg 
cggctggcct 
cggccgaaga 
tagcgccgct 
tgtgcgtgca 
tccatgcgcc 
agctcatagt 
atgtgctgca 
tgccgctggg 
tgcagggcac 
agagcgtgga 
ttaaactgct 
tcgggggcca 
gcacagccgg 
gaagcc 



ccgcctcacg 
tggcaaaagg 
tcagtttctt 
gcaagggtgg 
ccggccggtc 
gcagctcgtt 
aggtcatggg 
acgcctcggg 
tctgggcctc 
ggttggcgcc 
ccgggcgctc 
aggccgacgg 
cgtcgggctc 
ccaggggcct 
gccccagcgc 
cagactgctc 
actgcaggaa 
tctcttccaa 
gccggttcac 
acagctcaga 
gctggcggct 
gcaggcctcc 
tgtcagacat 
gctccccaat 
aggcacgcag 
ccatgatgct 
tggaggggct 
gagcagtggt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1726 



100/228 



BNSDOCID: <WO 01 62927A2_I_> 



I 



WO 01/62927 



PCT/US01/06059 



<213> Homo sapiens 
<220> 

<221> misc_f eature { 

<223> Incyte ID No: LG:482261 . 1 :2000FEB18 



<400> 205 

ctccacgttg 

tctttatttc 

tatcaggcag 

gttacacctg 

ggtcggcctt 

aaaagtaaac 

aaggctagca 

tttcctgaac 

caagtttgta 

aaagcagcct 

tctgcagagg 

gaaaatacag 

gctgctccta 

aggagcagtc 

gcttggagcc 

ggcctcagga 

ctaatgttaa 

ttgatgcctg 

aagcggtatg 

agtatggtcc 

tattgactcc 

ttcagtttaa 

taatcctgct 

tgtaaacaac 

agggcatggg 

caaggctcaa 

gctctcactg 



cgatggatcc 
ttttctcatt 
gttcccctac 
agcgtggtcg 
gcggtaagct 
attctgcata 
cagaaaattt 
atggtaccat 
aggaaggaaa 
cggaaccgtt 
aaggtggtga 
acaaacctgg 
aaatagagaa 
ggctcctcat 
ttggaggtct 
accacaagcg 
tgaatcgcca 
gcagtttgca 
ggaaccattt 
caattcgcct 
gtgtgattgg 
gacttggtgg 
attgccatta 
catcaggact 
agaaaattca 
aggaaccata 
atgaaagtgc 



ttggacccac 
ccctcgtctc 
atctttggca 
ttgtgaagaa 
tgtgcactcg 
tttacat.ttt 
gattactctg 
ggacttcaaa 
atttatcccc 
tcaatcggaa 
tgctgctgaa 
agatgagtta 
gccatatatg 
tgggatcatc 
gaatctcaaa 
catggtgtag 
ttacagttta 
gtagttttgc 
tcttttaagc 
ttcgtccgat 
gagattttaa 
accgacgagg 
catttgaaca 
tcgagatgat 
ggatctggga 
tcctgatttc 
caggaaggcc 



ttttgttaac 
caccgggaag 
cccaacacgg 
cggtctgtcc 
gagcattcca 
attaagctcc 
tttccaacag 
gattgggaac 
ctaacagcct 
aatgaggctt 
ggaggagagg 
atttcttttg 
ccaagatgtt 
cggagtggcc 
tttgccctgt 
caccagtaca 
ttattcggca 
aacccccacg 
tgctcaaaga 
cgctattgca 
cgaaagttac 
ctcaaaatca 
acttctagga 
gccattgaac 
actacttatc 
attgctcgcc 
gggtg 



tcttaaactt 

cjggagagcct 

tctcctcgaa 

aggaactccc 

gggacaccat 

tcttaaagag 

tagagcaata 

aggtgggaat 

ggtcaaactg 

atcctccagc 

atagtgaaga 

aggagcacgt 

taaaacaaag 

gcctccaata 

ttcacgaatg 

acataaggct 

ggctagatta 

acagcaaggt 

tcttaaagca 

atctgtggct 

actttcgccc 

gatcgaaaaa. 

atagggggtc 

aaattcgcaa 

agtcttttaa 

ttcaagacgc 



tgtgtctttg 
gcgggtggtg 
cccaggtgaa 
gagaacgtgt 
gggacaatcc 
ggcaggaatt 
ttgtccttgg 
tgccttaaaa 
ggctatagtt 
agaaagaatt 
agattttgag 
gggaccttca 
aagggccttg 
aagcaatgta 
aatgaattgt 
gcactgccat 
gccggagatc 
ggagcccatc 
gctgttggtc 
cagaataagc 
tcccaatttc 
accgtgctgc 
aatggggaac 
ttgctgtttg 
ttctattaga 
agcacagaag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1595 



<210> 206 

<211> 2006 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
<223> Incyte ID No: 



LG: 480328 . 1:2000FEB18 



<400> 206 

gccacaagcc 

gaatgtgggg 

agtgataaac 

gttagacatg 

gcttttaata 

tttaaatgta 

agaattcata 

tcaatggatt 

cagttcagtt 

tatatgcagt 

tttttttaat 

ttcatagtag 

atcttccagc 

tgttgatata 

atcaacatct 

atcatatata 

agatgtggag 

ccaaagtgtt 

gtgaactctg 

cgtggagtac 

aaaccaatta 

aaagatttac 



tcatatatca 
aaacttttag 
cctatgaatg 
aaagcgttca 
gtagccatga 
acaaatgcag 
tttgatagag 
tatcccactg 
tttattaggc 
ttctttggta 
aacataagaa 
aggaaaactg 
aacaatcatt 
agcgatgatg 
gcttcctata 
aatgatgata 
gtcagagtgg 
tgagtccatc 
ccttatgtaa 
agaacattgt 
tctttgactg 
agaatgtcaa 



tgaccgaact 
tcatgcttca 
taaaagatgt 
tgctgatgga 
acttagtata 
aaggtccttt 
aaattgtggc 
aacaatttga 
atcaatttgt 
tgacctgtgt 
aacttttctg 
agaaaacgta 
tattgttaaa 
ggaaaggcta 
gctgggatga 
gtagtaccta 
aagcaggtgt 
ggcaagtttg 
tgtggatgct 
ggtaggggaa 
ctgttctcaa 
catcacactg 



catgctggtg 
catcttatta 
gggaaggcat 
aatccctata 
catcatagag 
aggcttagat 
atatacaggt 
atgaagacaa 
attggtgtga 
taactccaga 
atattccbct 
ggaaaccatt 
cctggaaaaa 
tttagccact 
tatgggggaa 
cagtatagtg 
gagagggtcc 
ggcctggcct 
gggcacagag 
gggactcact 
ccacgtagtg 
tgoatcctct 



aaaagtccta 
ttcatgagag 
ttcactgtgc 
tgtgtgaaga 
ttcatactgg 
ccatccttga 
agtaaagaaa 
tgtggaatgg 
acttttcaaa 
ttcagagaat 
agttttggat 
ctttatactt 
gccattacac 
ctgcatattt 
gatatttatc 
ctgttagaat 
cgcagaagaa 
tagctgttgc 
ctgccatctt 
ttctcatccc 
cgccagtcat 
tccggcccat 



tgaatgtaaa 
aattcatacc 
ctcatatctt 
gtgtgggaaa 
tgagaaaccc 
agtacatcag 
gcacattgtg 
ctttgagata 
atgtaaatca 
ttataatgta 
tgccaaaata 
gcatgtaaac 
aggaaacctc 
aatgctcatg 
ccttctgggg 
tacatgagtt 
aacatggctg 

tgaccaattc 
atgtgtacaa 
cactggtagc 
cactagccag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 



101/228 



BNSDOCID: <WO 01 62927A2_I_> 



WO 01/62927 



tdaccc 



cttcctcgca tcttcSecag 
accacggaga tcctcaagtc 
agggagctgg tctccagcca 
ctcattctgg acgacgtgtc 
tggcccccca ggaagcagag 
ctgagggtga ctccaaggca 
gccagagcga, gctgtgcaag 
ctcggaacat cacctgcctg 
gccccagcct acctgcacct 
ccacctttcg ccctcacccc 
agtaaaggta aaatcacttc 
aaaaaattat gggaaactac 

<210> 207 
<211> 984 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: LG: 311197 . 1 :2000MAYi9 



cattggagag 
actggtggct 
ggtgagcaac 
tttgacacat 
agcgccagat 
, get gage tga 
ctggaagctg 
ccggcagggc 
ccgtgaggca 
agaaatcact 
agatctctaa 
atgaaa 



gactacgatg 
cgctttgatg 
aaccttatgg 
ctgaccttct 
ttgtggtgga 
tcgccaactc 
cagaagacat 
agtccgtgct 
actgggccac 
gaaatttcat 
aaaacaaaga 



PCT/US01/06059 



agtg^tgct 
ctggagaact 
agt gage age 
tgaaggagtt 
aaaggeggee 
actggccact 
tgcataccag 
cctccagctg 
agccccgatg 
gattggctta 
attacatgag 



geegttcatt 
aatcacccag 
cacctttggg 
aacagacagg 
atcatctctg 
gcaggggacg 
ctctcatgct 
ccctagtgag 
attegtaaca 
aagtgaagga 
ttagtacatg 



1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2006 



<400> 207 

gctctccacg 

gecaaatttg 

atgaaaaaca 

ttcccacatt 

gctgtcatcc 

caatgaagag 

agtgatgctc 

ttggtgggaa 

ggtccaggca 

gecgaatttg 

cagtgatatc 

gtctggtgac 

tgaagatatt 

tttatacagt 

caccatgact 

cagccaaccc 

tgatacagee 



tgcgcgactg 
atatctggag 
aaacaaaaaa 
ccaatctatg 
actgacttta 
cagcagaaag 
aagttagtag 
cagaagcatt 
gatgeaaage 
aagatggtga 
tgeaaaatec 
tattttaaga 
ctaaacctgg 
aaaaccatga 
tggt'tcagtg 
ccccagtccc 
aagctcaatg 



cgaggctgga 
cacctggaac 
gaaattttgc 
tetgectttg 
catttgette 
acgtcacact 
aacagatcaa 
gctggcttct 
ttctcttcac 
ggttgcgagt 
tgaatattag 
agaagaagaa 
agagttctcc 
cccctatata 
acagcccttt 
cagaagcact 
cagg 



cgctacgggc 
ettgeagagt 
tetgeaagtc 
tgttcagcag 
ctgggagctt 
gagagtatct 
tatatcccaa 
gaaaacccac 
ccctcagcat 
cagcttctca 
aagatcagaa 
aaaagacaaa 
aacagcttca 
tgaccccatc 
gaeggaacaa 
tgcggatatg 



tcctggaaag 
gttttgaaga 
tactgacaag 
acaccagcat 
gtggtccgcg 
ggagaccttc 
gactggtcag 
tggaccctgg 
aaaatgctgc 
gctgtggttt 
gagctttcct 
aataataagg 
ggttcatcag 
aatggaacac 
aactgeagea 
taccagcctc 



gagttaagat 60 
ggaggttaca 120 
tcccgtttcc 180 
ttgccacaat 240 
ttgaccatcc 3 00 
acgttggagg 360 
actttgetet 420 
acaaatatgg 480 
gccttcgtct 540 
ttaaagctgt 600 
tgttaaagcc 660 
aacccataat 720 
taagtcctgg 780 
cagcatcatc 840 
tcctcgcatt 9 00 
ggtctctggt 9 60 
984 



<210> 208 
<211> 565 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 1054883 . 1 :2000MAY19 



<220> 

<221> unsure 
<222> 19 
<223> a, t, c, 



g, or other 



<400> 208 

geagagtgee 

tccacgcatg 

acttatacag 

tatcaggtca 

gaatgtatgg 

ageccgagaa 

gtttttgaaa 

gaagaaaaag 

agtaaaaagc 

aataaatatg 



cctgaagang 
gecgaagcat 
taagggcttc 
ccgaagtagg 
aaacattaaa 
atgaccgacc 
taagaactgg 
aaatccccaa 
tagaaatagg 
tttgtcatga 



ccgacagccc 
gaggaagctc 
aactaacctt 
acagcaaatt 
agaaacttgt 
tgatgcgtgt 
ecttttgeta 
gcaaataact 
atgtggttct 
gtcag 



ctgctccagt 
ategttagat 
actcaaaatg 
aaaacaatct 
ttgtataatg 
tataacccat 
ggtgatacaa 
ttaagatttg 
cttaactgag 



cacacccgga 
tcatttttct 
gggactgttc 
ttctgttcta 
ccactcagta 
ctgagcccgc 
gtaaaataat 
atgcttgtgc 
aaaggagcta 



agctgactgg 60 
taaattttgg 120 
ccagtgtatt 180 
tagttattat 240 
caaggtatgt 300 
tgcaaccacc 360 
aactagaaca 420 
agecattaat 480 
aagagtagaa 540 
565 



102/228 



BNSDOCIO. <WO 01 62927A2J_> 



WO 01/62927 



PCT/US01/06059 



<210> 209 
<211> 567 
<212> DNA 

<213> Homo sapiens 1 
<220> 

<22I> misc_feature 

<223> Incyte ID No : 1X3:399395 . 1 :2000MAY19 



<400> 209 

acagttcaca 

atatgatggc 

taaaagttgg 

ttcaaaaata 

gaggccaagg 

tgagaccctg 

tagtgtacat 

gttaatacat 

tgggtgggaa 

attaagacca 



cttactactg 
tgagtgagaa 
ccatatagta 
taggaatggg 
caggaagatc 
actcaaaaaa 
caacctggag 
gggaaaaggt 
agaggctgcc 
atatcaatat 



ttgtagttat 
tgatctagaa 
acatttttga 
tcgggtgtgg 
acttgagccc 
taaaaagaaa 
gaatattata 
atatgaaata 
atgctggctg 
cccactg 



aaattattca 
ctatgccagg 
tgtccaaatt 
tggctcatgc 
aggagttcaa 
agacaaaaat 
tagtctttta 
ggtggaacca 
aggtacctga 



cttgttttaa 
gattagataa 
ggagaggaat 
ctgtaatccc 
gaccagcctg 
acaagaatgg 
aaataacaaa 
agcagaactg 
gaacaatgaa 



ataagcacat 60 
aaacacagag 120 
ccctcaaatg 180 
agcactttgg 240 
ggcaacacag 300 
ctaagcaaac 3 60 
tatgtagatt 420 
agtcccaagg 480 
aacaagtcat 540 
567 



<210> 210 
<211> 971 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: LG : 380497 . 2 : 2000MAY19 



<400> 210 

tcgccccgtc 

tctccagccc 

tccgacatgt 

aacactgcct 

tggtctaaag 

caagtgaaaa 

gcccatcagg 

accacagatt 

atctccaaaa 

tccagaggag 

gcggatggca 

gacgcaggtg 

ggaaagttac 

ggcccaggag 

ctgctgggac 

tgagggccgg 

accagtagag 



cgctgcagtc 
gcccagtggc 
ctatgctggc 
ggagtaatga 
gaaagggttt 
ataaccacct 
atgattttaa 
cctcggacaa 
accgtgttca 
aacgaaacca 
gcactgaaga 
gaacgtaaaa 
ctccagccta 
cgagtggcca 
cagagttcca 
gacttcaccc 
a 



cgccggcgag 
cgcagtcacc 
tgaacgtcgg 
cgattccaag 
aggggctcag 
gggactcgga 
ccagcttctg 
gaaggaaaag 
ctatatgaaa 
cgacaaccag 
acaagcccca 

gggggaagaa 

aggccaagag 
agaagaagag 
aggcctctgc 
tgaaagccaa 



ggagttacgc 
caggtccaga 
cggaagcaga 
tttggccagc 
gagcaaggag 
gctaccatca 
gccgaactga 
aaatctttta 
ttcacaaaag 
cgccttcacc 
ggttccagtt 
aagaaataaa 
gcacacggag 
cgcgccagca 
tcaggatgca 
aaagaggaga 



acgtcctgat 
ggcggcggta 
agtgggctgt 
ggatgctaga 
ccacagatca 
ataatgaaga 
acacttgcca 
gccttgagga 
ggcgatgcca 
atccaggagt 
ccagggtctg 
gaggccacag 
ggaaagcccg 
gaagagcagc 

ggggaccatg 

ggcgagaaaa 



tctcctggag 
tcacaggctc 
ggatcctcag 
gaagatgggg 
tattaaagtt 
caactggatt 
tgggcaggaa 
aaagtccaaa 
gtccctccac 
actttgccaa 
acatttctga 
gtaaagatgt 
agagggccga 
tcagaggccc 
tgcagcogcc 
agctgcaaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

971 



<210> 211 
<211> 486 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 



LI : 272913 .22: 2000MAY01 



<400> 211 

agcgcgacag 

acgaccctgg 

gcgctacccg 

ttttattgta 

gaccctggtc 

cccctcaaag 

ctgccacaac 

atagacaacc 



cggcagcacc 
tgccctgggc 
agatatgcac 
aaacgacacg 
caaactttca 
gtgacttggc 
atgtcaactg 
aaatctgtca 



aagggaacgg 
tgccgccctg 
ccaatgtcca 
agagctaatg 
tcaggcacat 
caacaccttc 
ttctggagga 
agggcaaaag 



aaaatggcgc 
ctcctcgctc 
gggagcgtgc 
ctgcatgccc 
accactgtca 
cgtggcttta 
attaatgcct 
aacctttgca 



ctcacggocc 
tgggcgtgga 
aaaatttgtc 
gttgctgcct 
tcatagacct 
ctcagctcca 
ggaatactat 
ataacactgg 



gggtagtctt 60 
aagggctctg 120 
aaaagtggcc 180 
gaatcagaag 240 
gcaagcaaac 3 00 
gactctgata 3€0 
cacctcttat 420 
ggaoccagaa 480 
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atgtgt 486 

<210> 212 
<211> 115 
<212> PRT 
<213> Homo sapiens 

<220> ' 

<221> mis cofeature 

<223> Incyte ID'.No: LG: 1040582 . 1 .orf3 : 20 00FEB18 
<400> 212 



lie 


Leu 


Ala Thr 


Leu 


Ser 


Ser 


Pro Gin Pro 


Val 


Leu' He Ala 


Lys 


1 






5 






10 




Thr Gly Gin 


15 


Gly 


Thr 


Met Thr 


Ala 


Ser 


Ser 


Val Leu Leu 


His 


Lys 






20 






• 25 






30 


Met 


Pro 


Leu lie 


Gly 


Leu 


Gly 


Thr Trp Lys 


Ser 


Glu Pro Gly 


Gin 








35 




40 






45 


Val 


Lys 


Ala Ala 


He 


Lys 


Tyr 


Ala Leu Ser 


Val 


Gly Tyr Arg 


His 






50 






55 






60 


lie 


Asp 


Cys Ala 


Ser 


Val 


Tyr 


Gly Asn Glu 


Thr 


Glu He Gly 


Glu 








65 






70 






75 


Ala 


Leu 


Lys Glu 


Ser. 


Val 


Gly 


Ser Gly Lys 


Ala 


Val Pro Arg 


Glu 








80 






85 






90 


Glu 


Leu 


Phe Val 


Thr 


Ser 


Lys 


Leu Trp Asn 


Thr 


Lys His His 


Pro 








95 




100 






105 


Glu 


Asp 


Val Glu 


Pro 


Ala 


Leu 


Arg Lys Thr 












110 






115 









<210> 213 
<211> 161 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG:453570 . .1 . orf 3 : 2000PEB18 

* I 

<400> 213 



Cys 


Thr 


Leu 


Leu Arg 


Leu He Pro ' Ser 


Ala 


Ser 


His Phe Lys 


Arg 


1 






5 




10 






15 


Phe 


Asp 


Arg 


Val Arg 


Arg Phe Ala Pro 


Ala 


Ala 


Met Ala Thr 


Ser 




20 




25 






30 


Ser 


Gly 


Pro 


Lys Glu 


Ala Pro Ala Asn 


Asn 


Pro 


Gly Leu Gin 


Thr 






35 




40 






45 


Glu 


Val 


Asp 


Pro Ala 


Thr Lys Gly Tyr 


Phe 


Leu 


Gin Gin Thr 


Met 






50 


55 






60 


Leu 


Arg 


Val 


Lys Asp 


Pro Lys Val Ser 


Leu 


Asp 


Phe Tyr Ser 


Arg 








65 




70 






75 


Val 


Met 


Gly 


Met Ser 


Leu Leu Lys Arg 


Leu 


Asp 


Phe Glu Glu 


Met 






80 




85 




90 


Lys 


Phe 


Ser 


Leu Tyr 


Phe Leu Gly Tyr 


Glu 


Asp 


Val Thr Leu 


Ala 








95 




100 






105 


Pro 


Asp 


Asp 


His He 


Lys Arg Thr Glu 


Trp 


Thr 


Phe Arg Gin 


Lys 








110 




115 






120 


Ala 


Thr 


Leu 


Glu Leu 


Thr His Asn Trp 


Gly 


Thr 


Glu Asn Asp 


Pro 








125 




13 0 






135 


Glu 


Phe 


Lys 


Gly Tyr 


His Asn Gly Asn 


Ser 


Asp 


Pro Arg Gly 


Phe 








'140 




145 






150 


Gly 


His 


He 


Gly Val 


Thr Val Asp Asp 


Val 


His 












155 




160 









<210> 214 
<211> 332 
<212> PRT 
<213> Homo sapiens 
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<220> ' 
<221> misc_f eature 

<223> Incyte ID No: LG: 408751 .3 : orf 2 : 2000FEB18 
<400> 214 



Arg 


Asp 


Pro 


Gly Trp 


Gin 


lie 


Arg 


Asp 


Arg 


Ala 


Gly Leu 


Ala Trp 


1 








5 










10 










15 


Asn 


Met 


Leu 


Ala Asn 


Ser 


Ala 


Ser 


Val 


Arg 


lie 


Leu 


He 


Lys 


Gly 










20 










25 










30 


Gly 


Lys 


Val 


Val 


Asn 


Asp 


Asp 


Cys 


Thr 


His 


Glu 


Ala Asp 


Val 


Tyr 








35 










40 










45 


He 


Glu 


Asn 


Gly 


He 


He 


Gin 


Gin 


Val 


Gly 


Arg 


Glu 


Leu 


Met 


He 








50 










55 










60 


Pro 


Gly 


Gly 


Ala 


Lys 


Val 


He 


Asp 


Ala, 


Thr 


Gly 


Lys 


Leu 


Val 


He 








65 










70 










75 


Pro 


Gly 


Gly 


He 


Asp 


Thr 


Ser 


Thr 


His 


Phe 


His 


Gin 


Thr 


Phe 


Met 






80 










85 










90 


Asn 


Ala 


Thr 


Cys 


Val 
95 


Asp 


Asp 


Phe 


Tyr 


His 
100, 


Gly 


Thr 


Lys 


Ala 


Ala 
105 


Leu 


Val 


Gly 


Gly 


Thr 
110 


Thr 


Met 


He 


He 


Gly 
115 


His 


Val 


Leu 


Pro 


Asp 
120 


Lys 


Glu 


Thr 


Ser 


Leu 


Val 


Asp 


Ala 


Tyr 


Glu 


Lys 


Cys 


Arg 


Gly 


Leu 








125 










13 0 










135 


Ala 


Asp 


Pro 


Lys 


Val 


Cys 


Cys 


Asp 


Tyr 


Ala 


Leu 


His 


Val 


Gly 


lie 










140 








145 










150 


Thr 


Trp 


Trp 


Ala 


Pro 


Lys 


Val 


Lys 


Ala 


Glu 


Met 


Glu 


Thr 


Leu 


Val 






155 










160 










165 


Arg 


GlU 


Lys 


Gly Val 


Asn 


Ser 


Phe 


Gin 


Met 


Phe 


Met 


Thr 


Tyr 


Lys 






170 










175 










180 


Asp 


Leu 


Tyr 


Met 


Leu 


Arg 


Asp 


Ser 


Glu 


Leu 


Tyr 


Gin 


Val 


Leu 


His 






185 








190 










195 


Ala 


Cys 


Lys 


Asp 


He 


Gly 


Ala 


He 


Ala 


Arg 


Val 


His 


Ala 


Glu 


Asn 








200 










205 










210 


Gly 


Glu 


Leu 


Val 


Ala 


Glu 


Gly 


Ala 


Lys 


Glu 


Ala 


Leu 


Asp 


Leu Gly 










215 










220 










225 


He 


Thr 


Gly 


Pro 


Glu 


Gly 


He 


Glu 


He 


Ser 


Arg 


Pro 


Glu 


Glu 


Leu 








23 0 










235 










240 


Glu 


Ala 


Glu 


Ala 


Thr 


His 


Arg 


Val 


He 


Thr 


He 


Ala 


Asn 


Arg 


Thr 










245 








250 










255 


His 


Cys 


Pro 


He 


Tyr 


Leu 


Val 


Asn 


Val 


Ser 


Ser 


He 


Ser 


Ala Gly 








260 










265 










270 


Asp 


Val 


He 


Ala 


Ala 


Ala 


Lys 


Met 


Gin 


Gly Lys 


Val 


Val 


Leu 


Ala 








275 








280 










285 


Glu 


Thr 


Thr 


Thr 


Ala 
290 


His 


Ala 


Thr 


Leu 


Thr 
295 


Gly 


Leu 


His 


Tyr 


Tyr 
300 


His 


Gin 


Asp 


Trp 


Ser 


His 


Ala 


Ala 


Ala 


Tyr 


Val 


Thr 


Val 


Pro 


Pro 






305 










310 










315 


Leu 


Arg 


Leu 


Asp 


Thr 


Asn 


Thr 


Ser 


Thr 


Tyr 


Leu 


Met 


Ser 


Leu 


Leu 



320 325 330 

Ala Lys 



<210> 215 
<211> 274 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 090574 . 1 .orf 3 :2000FEB01 



Ser Arg 
15 

Trp Lys 
30 

He Glu 



<400> 215 

Ala Arg Asn Ser Ala Arg Val Pro Pro Arg Gly Thr Met 

15 10 
Leu Ser Trp Gly Tyr Arg Glu His Asn Gly Pro He His 

20 25 
Glu Phe Phe Pro He Ala Asp Gly Asp -Gin Gin Ser Pro 
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BNSDQCID: <WO 01 62927A2J_> 



WO 01/62927 g 


Ik 




lie 


Lys 


Thr 


Lys 


•3 b 

GJLU 


Val Lys Tyr Asp 










bU 




Ser 


lie 


Lys 


Tyr 


Asp 


fro ber ber Aia 








Ob 




Gly 


His 


Ser 


Phe 


Asn 


vai Asp Fne Asp 










q n 




Val 


Leu 


Arg 


Gly 


r>1\r 


irro ueu mr <-»iy 










ZD 




Val 


His 


Leu 


His 


Trp 


Gly Ser Ala Asp 










11U 




lie 


Val 


Asp 


Gly 


vai 


Ser Tyr Ala Ala 










lzb 




Trp 


Asn 


Ser 


Asp 


Lys 


iyr irro &er fne 










14U 




Pro 


Asp 


Gly 


Leu 


Ala 


vai Leu Giy vai 






lbb 


Pro 


Asn 


Ser 


Gin 


Leu 


Gin Lys He Tnr 










170 




Lys 


Glu 


Lys 


Gly 


Lys 


Gin Thr Arg Phe 










IOC 

185 




Ser 


Leu 


Leu 


Pro 


Pro 


Ser Trp Asp Tyr 










200 




Leu 


Thr 


val 


Pro 


Pro 


Leu Leu Glu Ser 










215 




Lys 


Gin 


Pro 


He 




Tift Qqv Qav y\ 

lie oeir ier uiii 










230 




Ser 


Leu 


Leu 


Cys 


Thr 


Ala Glu Gly Glu 








245 


His 


His 


Leu 


Ala 


Thr 


He Pro Cys Gly 










260 


Ser 


Gly 


His 


Cys 







PCT/US01/06059 

40 45 



Ser 


Ser 


Leu Arg 


Pro 


Leu 


55 








60 


Lys 


He 


lie Ser 


Asn 


Ser 


70 






75 


Asp 


Thr 


Glu Asn 


Lys 


Ser 


85 






90 


Ser 


Tyr 


Arg Leu 


Arg Gin 


100 








105 


Asp 


His 


Gly Ser 


Glu 


His 


115 








120 


Glu 


Leu 


His Val 


Val 


His 


130 








135 


Val 


Glu 


Ala Ala 


His 


Glu 


145 








150 


Phe 


Leu 


Gin He 


Gly Glu 


160 








165 


Asp 


Thr 


Leu Asp 


Ser 


He 


175 








180 


Thr 


Asn 


Phe Asp 


Leu 


Leu 


190 








195 


Trp 


Thr 


Tyr Pro 


Gly 


Ser 


205 








210 


Val 


Thr 


Trp He 


Val 


Leu 


220 






225 


Gin 


Leu 


Ala Lys 


Phe Arg 


235 








240 


Ala 


Ala 


Ala Phe 


Leu 


Val 


250 






255 


Ser 


Ser 


Ser Ala 


Thr 


Cys 


265 








270 



<210> 216 
<211> 182 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 229932 . 2 . orf 1 :2000FEB01 
<400> 216 



Lys 


Leu 


Pro 


Leu 


Pro 


Pro 


Gly 


Ala Phe 


Ser 


Gly 


Leu Trp 


Lys Asn 


1 








5 








10 






15 


Gin 


Glu 


Ala 


Phe 


Lys 


His 


Leu 


Tyr Phe 


Glu 


Lys 


Phe Pro 


Gly Tyr 










20 








25 






30 


Tyr 


Asp 


Thr 


Met 


Asp 


Ala 


Gly 


Tyr Met 


Asp 


Glu 


Glu Gly 


Tyr Leu 










35 








40 






45 


Tyr 


Val 


Met 


Ser 


Arg 


Val 


Asp 


Asp Val 


He 


Asn 


Val Ala 


Gly His 










50 








55 






60 


Arg 


He 


Ser 


Ala 


Gly 


Ala 


He 


Glu Glu 


Ser 


He 


Leu Ser 


His Gly 










65 








70 






75 


Thr 


Val 


Ala 


Asp 


Cys 


Ala 


Val 


Val Gly 


Lys 


Glu 


Asp Pro 


Leu Lys 










80 








85 






90 


Gly 


His 


Val 


Pro 


Leu 


Ala 


Leu 


Cys Val 


Leu 


Arg 


Lys Asp 


He Asn 










95 






100 






105 


Ala 


Thr 


Glu 


Glu 


Gin 


Val 


Leu 


Glu Glu 


He 


Val 


Lys His 


Val Arg 










110 








115 




120 


Gin 


Asn 


He 


Gly 


Pro 


Val 


Ala 


Ala Phe 


Arg 


Asn 


Ala Val 


Phe Val 










125 








130 






135 


Lys 


Gin 


Leu 


Pro 


Lys 


Thr 


Arg 


Ser Gly 


Lys 


He 


Pro Arg 


Ser Ala 










140 








145 






150 


Leu 


Ser 


Ala 


He 


Val 


Asn 


Gly 


Lys Pro 


Tyr 


Lys 


He Thr 


Ser Thr 










155 






160 






165 


He 


Glu 


Asp 


Pro 


Ser 


He 


Phe 


Gly His 


Val 


Glu 


Glu Met 


Leu Lys 
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170 175 180 

Gin Ala 



<210> 217 
<211> '359 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 33 2176. 1 . orf 2 : 2000FEB01 
<400> 217 





Tip 

lie 


Lys 


Arg 


dor* 


Lys 


Gly Lys 


Glu 




Pro 


Phe 


Val 


Leu 


Thr 


-I 
X 








c: 
-j 








10 

J- V 










15 




dor 


Phe 


Phe 


/ilct 


Gly 


Ser Gin Lys 




Gly Ala Val 


Trp 


Thr 








20 








1 25 










30 


j.y 


Asp 


Asn 


Thr 


TV! a 


Glu Trp Ser Asn 


T ,D1 1 


Lys 


lie 


Ser 


He 


Pro 






O 3 


















45 




Leu 


Leu 


Thr 


Leu 


Ser 


He Thr Gly 


lie 


Ser 


Phe 


Cys 


Gly Ala 










3U 








cc 
33 










60 


Asp 


lie 


Gly Gly 


rile 


He 


Gly Asn 


Pro 


Pin 


Thr 


Glu 


Leu 


Leu 


Val 








03 








/ u 










75 


Arg 


Trp 


Tyr 


Gin 


Ala 


Gly Ala Tyr 


Gin 


Pro 


Phe Phe Arg Gly His 










on 
o U 








O -J 










90 


Ala 


Thr 


Met 


Asn 


Thr 


Lys 


Arg Arg 


Glu 


Pro 


Trp 


Leu 


Phe 


Gly Glu 










y 3 






100 










105 




HIS 


Thr 


Arg 


Leu 


He 


Arg Glu 


Ala 


lie 


Arg 


Glu Arg Tyr Gly 










11 A 

llv 








ii-j 










120 


LeU 


Leu 


Pro 


Tyr 


Trp 


Tyr 


Ser Leu 


Phe 


Tyr 
13 0 


His 


Ala 


His 


Val 


Ala 
135 


ser 


pi « 

'j-Lil 


Pro 


Val 


Mat* 

net 
1 AO 


Arg 


Pro Leu 


Trp 


val 

145 


Glu 


Phe 


Pro 


Asp 


Glu 
150 


J-IC5UL 




Thr 


Phe 


155 


Met 


Glu Asp 


Glu 


160 


Met 


Leu 


Gly 


Ser 


Ala 
165 


Leu 


J. JT^ 


Val 


His 


Xr ± w 


Val 


Thr Glu 


Pro 


Lys 


Ala 


Thr 


Thr 


Val 


Asp 








170 








175 










180 


Val 


XT HKm 


Leu 


Pro 


185 


Ser 


Asn Glu 


Val 


190 


Tyr 


Asp 


Tyr 


Lys 


Thr 
195 


Phe 


Ala 


His Gly 


Glu 


Gly Gly Cys 


Thr 


Val 


Lys 


He 


Pro 


Val 


Ala 










200 








205 










210 


Leu 


Asp 


Thr 


He 


Pro 


Val 


Phe Gin Arg 


Gly 


Gly 


Ser 


Val 


He 


Pro 








215 








220 










225 


lie 


Lys 


Thr 


Thr 


Val 


Gly Lys Ser 


Thr 


<3ly 


Trp Met 


Thr 


Glu 


Ser 








230 








235 










240 


Ser 


Tyr 


Gly Leu 


Arg 


Val 


Ala Leu 


Ser 


Thr 


Lys 


Gly 


ser 


Ser 


Val 








245 








250 










255 


Gly 


Glu 


Leu Tyr 


Leu 


Asp Asp Gly His 


Ser 


Phe 


Gin 


Tyr 


Leu 


His 










260 








265 










270 


Gin 


Lys 


Gin 


Phe 


Leu 


His 


Arg Lys 


Phe 


Ser 


Phe Cys 


Ser 


Ser 


Val 








275 








280 










285 


Leu 


He 


Asn 


Ser 


Phe 


Ala Asp Gin Arg 


Gly 


His 


Tyr 


Pro 


Ser 


Lys 










290 








295 










300 


Cys 


Val 


Val 


Glu 


Lys 


He 


Leu Val 


Leu 


Gly 


Phe 


Arg Lys 


Glu 


Pro 










305 








310 










315 


Ser 


Ser 


Val 


Thr 


Thr 
320 


His 


Ser Ser 


Asp 


Gly 
325 


Lys 


Asp 


Gin 


Pro 


Val 
330 


Ala 


Phe 


Thr 


Tyr 


Cys 


Ala 


Lys Thr 


Ser 


He 


Leu 


Ser 


Leu 


Glu 


Lys 








335 






340 










345 


Leu 


Ser 


Leu 


Asn 


He 


Ala Thr Asp 


Trp 


Glu 


Val 


Arg 


He 


He 





350 355 

<210> 218 
<211> 110 
<212> PRT 
<213> Homo sapiens 
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<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 403248 . 2 .orf 2 :2000FEB01 
<400> 218 



SCI 




rile 


lie Ser 


TiOii Prn p\/e Cot* Ala 




Tion TiV<* P*ro 


Cor 






i 


5 


10 




15 


±111. 


V71U 




TKnr Qor Cor Qor T.qii 
Xjri. O ci Del Oex , UcU 


irp 


Pl\; Dyn Ala 
oiy ITL,\J rlj.cl 


val 








' , 20 


25 






30 


Asp 


Gly 


Cys 


Asp • Cys 


Val Ala Glu Gly Leu 


Trp 


Leu Pro Gin 


Leu 








35 


40 






45 


His 


Val 


Gly 


Asp Trp 


Leu Val Phe Asp Asn 


Met 


Gly Ala Tyr 


Thr 








50 


55 




60 


Val 


Gly 


Met 


Gly Ser 


Pro Phe Trp Gly Thr 


Gin 


Ala Cys His 


He 








65 


70 






75 


Thr 


Tyr 


Ala 


Met Ser 


Arg Val Ala Trp ' Glu 


Ala 


Leu Arg Arg 


Gin 








80 


85 






90 


Leu 


Met 


Ala 


Ala Glu 


Gin Glu Asp Asp Val 


Glu 


Gly Val Cys 


Lys 








95 


100 






105 


Pro 


Leu 


Ser 


Cys Gly 
















110 











<210> 219 

<211> 549 

<212> PRT 1 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: LG: 220992 . 1 .orf 3 :2000MAY19 
<400> 219 



Arg 


Pro Val Thr Ser 


Phe Ser Pro Leu 


Pro 


Gly 


Ser Cys Gly 


Gly 


1 


5 




10 


15 


Arg 


Leu Gly Thr Arg 


Thr Met Leu Gly 


Arg 


Ser 


Leu Arg Glu 


Val 




20 




25 






30 


Ser 


Ala Ala .Leu Lys 


Gin Gly Gin He 


Thr 


Pro 


Thr Glu Leu 


Cys 




< ,'3.5 


40 






45 


Gin 


Lys Cys Leu Ser 


Leu He Lys , Lys 


Thr 


Lys 


Phe Leu Asn 


Ala 




50 




55 






60 


Tyr 


He Thr Val Ser 


Glu Glu Val Ala 


Leu 


Lys 


Gin Ala Glu 


Glu 




65 




70 




75 


Ser 


Glu Lys Arg Tyr 


Lys Asn Gly Gin 


Ser 


Leu 


Gly Asp Leu 


Asp 




80 




85 






90 


Gly 


He Pro He Ala 


Val Lys Asp Asn 


Phe 


Ser 


Thr Ser Gly 


He 




95 


100 






105 


Glu 


Thr Thr Cys Ala 


Ser Asn Met Leu 


Lys 


Gly 


Tyr He Pro 


Pro 




110 




115 






120 


Tyr 


Asn Ala Thr Val 


Val Gin Lys Leu 


Leu 


Asp 


Gin Gly Ala 


Leu 




125 




130 


135 


Leu 


Met Gly Lys Thr 


Asn Leu Asp Glu 


Phe 


Ala 


Met Gly Ser 


Gly 




140 




145 






150 


Ser 


Thr Asp Gly Val 


Phe Gly Pro Val 


Lys 


Asn 


Pro Trp Ser 


Tyr 




155 




160 






165 


Ser 


Lys Gin Tyr Arg 


Glu Lys Arg Lys 


Gin 


Asn 


Pro His Ser 


Glu 




170 




175 






180 


Asn 


Glu Asp Ser Asp 


Trp Leu He Thr 


Gly 


Gly 


Ser Ser Gly 


Gly 




185 




190 






195 


Ser 


Ala Ala Ala Val 


Ser Ala Phe Thr 


Cys 


Tyr 


Ala Ala Leu 


Gly 




200 




205 




210 


Ser 


Asp Thr Gly Gly 


Ser Thr Arg Asn 


Pro 


Ala 


Ala His Cys 


Gly 




215 




220 






225 


Leu 


Val Gly Phe Lys 


Pro Ser Tyr Gly 


Leu 


Val 


Ser Arg His 


Gly 




230 


235 






240 


Leu 


He Pro Leu Val 


Asn Ser Met Asp 


Val 


Pro 


Gly He Leu 


Thr 




245 


250 




255 


Arg 


Cys Val Asp Asp 


Ala Ala He Val 


Leu 


Gly 


Ala Leu Ala 


Gly 
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260 










265 








270 


Pro Asp 


Pro 


Arg 


Asp 


Ser 


Thr 


Thr Val 


His 


Glu 


Pro 


He Asn 


Lys 






275 










280 








285 


Pro Phe 


Met 


Leu 


Pro 


Ser 


Leu 


Ala Asp Val 


Ser 


Lys 


Leu Cys 


He 








290 










295 








30,0 


Gly lie 


Pro 


Lys Glu 


Tyr 


Leu 


Val 


Pro 


Glu 


Leu 


Ser 


Ser Glu 


Val 




3 05 








310 








315 


Gin Ser 


Leu 


Trp 


Ser 


Lys 


Ala 


Ala 


Asp 


Leu 


Phe 


Glu 


Ser Glu Gly 








320 










325 








330 


Ala Lys 


Val 


lie 


Glu 


Val 


Ser 


Leu 


Pro 


His 


Thr 


Ser 


Tyr Ser 


He 






335 










340 




• 




345 


Val Cys 


Tyr 


His 


Val 


Leu 


Cys 


Thr 


Ser 


Glu 


Val 


Ala 


Ser Asn 


Met 




350 








355 








360 


Ala Arg 


Phe 


Asp 


Glv 


Leu 


Gin 


Tvx 


Gly His 


Arg 


Cys 


Asp He 


Asp 








365 










•J I V 








375 


Val Ser 


Thr 


Glu 


Ala 


Met 


Tyr 


Ala 




J. XX J. 


Arg 


Arg 


Glu Gly 


Phe 








380 








•J O J 








390 


Asn Asp 


Val 


Val 


Arcr 


Gly 


Arg 


He 


Leu 


ser 


Gly 


Asn 


Phe Phe 


Leu 






395 










400' 








405 


Leu Lys 


Glu 


Asn 


Tvx 


GlU 


Asn 


Tvx 


xrxlts 


V d_L 


Lys 


Ala 


Gin Lys 


Val 






410 










A1 ^ 








420 


Arg Arg 


Leu 


lie 


Ala 


Asn 


Asp 


Phe 1 


Val 


/loll 


Ala 


Phe 


Asn Ser Gly 






425 










430 








AT^ 
4 J O 


Val Asp 


Val 


Leu 


Leu 


Thr 


Pro 


Thr 


Thr 


Leu 


Ser 


Glu 


nla Val 


XT X. O 






440 










AAR 








A^n 


t Vyx Leu 


GlU 


Phe 


He 


Lvs 


Glu 


Asp 


Asn 


Arg 


Thr 


Ara 


OCl AJ.CL 


Ql n 

UXIl 






455 










460 










Asp Asp 


lie 


Phe 


Thr 


Gin 


Ala 


Val 


Asn 


Met 


Ala 


Gly 


T.pii pro 


Ala 






470 










475 








480 


Val Ser 


lie 


Pro 


Val 


Ala 


Leu 


Ser 


Asn 


Gin 


Gly 


Leu 


Pro He 


Gly 








485 










490 








495 


Leu Gin 


Phe 


lie 


Gly 


Arg 


Ala 


Phe 


Cys 


Asp 


Gin 


Gin 


Leu Leu 


Thr 








500 










505 






Val He 


510 


Val Ala 


Lys 


Trp 


Phe 


Glu 


Lvs 


Gin 


Val 


Gin 


Phe 


Pro 


Gin 




515 










520 








525 


Leu Gin 


GlU 


Leu 


Met 


Asp 


Asp 


Cvs 


Ser 


Ala 


Val 


Leu 


Glu Asn 


Glu 








53 0 






535 








540 


Lys Leu 


Ala 


Ser 


Val 


Ser 


Leu 


Lys 


Gin 
















545 




















<210> 220 
























<211> 264 
























<212> PRT 
























<213> Homo sapiens 




















<220> 


























<221> misc_f eature 




















<223> Incyte ID 


No: 


LG : 1094571 . 1 . orf 1 : 2000MAY19 






<400> 220 
























Arg Thr 


Pro 


Ala 


Ala 


Arg 


Arg 


Pro 


Ala 


Leu 


Arg 


Phe 


Gly Pro 


Pro 


1 






5 










10 








15 


Pro Pro 


Pro 


Thr 


Pro 


Leu 


Thr 


Leu 


Gly 


Thr 


Tyr 


Phe 


Gly Cys 


Leu 








20 








25 








30 


Arg cys 


Pro 


Pro 


Ala 


Glu 


Thr 


Gin 


Leu 


Leu 


Arg 


Arg 


Pro Ala 


Val 






35 










40 








45 


Phe Val 


Gly 


Ser Ala 


Ala 


Ser 


Gly 


He 


Arg 


Ser 


Gly 


Leu Trp 


Ser 






50 










55 








60 


Ala Ser 


Ser 


Gly 


His 


Trp 


Cys 


Ala 


Pro 


Ala 


Ala 


Gly 


Arg Ala His 






65 






70 








75 


Ala Pro 


Val 


Pro 


Arg 


Leu 


Val 


Arg 


Gly 


Leu 


Gly 


Ala 


Ala Ser 


Thr 








80 






85 








90 


Ala Ala 


Pro 


Gin 


Asp 


Ala 


Gin 


Thr 


<31y 


Pro 


Gin 


Pro 


Met Pro 


Arg 








95 








100 








105 


Ala Asp 


Cys 


lie 


Met 


Arg 


His 


Leu 


Pro 


Tyr 


Phe 


Cys 


Arg Gly Gin 








110 










115 








120 


Val Val 


Arg 


Gly Phe 


Gly 


Arg 


Gly 


Ser 


Lys 


Gin 


Leu 


Gly He 


Pro 
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Ms 




1 in 

J.JU 








loo 


Thr Ala 


Asn Phe 


rJ. \j 


VjIU. V^lll 


val Val nop 


Asn 


iieu. 


irX Q 


Ala Asp 






1 AH 




IftD 








1 cn 


He Ser 


Thr Gly 


Tl o 
lie 




Vaiy lip nia 


o tri 


iV ctl 


uiy 


Ser Gly 






i r*; 




1 fin 










Asp Val 


His Lys 


Me»t- 

lufcr l_ 


vox vai 


del lit: Uljf 


ixp 


.0.0 11 


ir i w 


lyx iyx 


Lys Asn 




1 7fl 

X / w 




1 

X / 3 








i on 

lOU 


Thr Lys 






uiu J- iii. nio 


lie 


Met 


nio 


inx file 


• 




lOO 












173 


Lys Glu 


Asp Phe 


Tyr 


Gly Glu 


lie JJcU rioXl 


Veil 


Ala 


11c 


vai VaXy 


















9 1 n 


Tyr Leu 


Arg Pro 


VjIU 


Ta^cj 2V en 
LiyS ±\SL1 


Phe Asp Ser 


JLlcU. 


rai ii 

«1U 




Leu He 






91 R 




990 


Lys 






99^ 


Ser Ala 


He Gin 


Gly 


Asp He 


Glu Glu Ala 


Lys 


Arg 


Leu Glu 






230 




235 






240 


Leu Pro 


Glu His 


Leu 


Lys He 


Lys Glu Asp 


Asn 


Phe 


Phe 


Gin Val 






245 


250 








255 


Ser Lys 


Ser Lys 


He 


Met Asn 


Gly His 














260 













PCT/US01/06059 



<210> 221 
<211> 701 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID NO: LI : 350754 .4 .orf 3 :2000MAY01 
<400> 221 

Glu Glu Ala Glu Glu Gly Arg Asn Met Ala Ala Leu Gly Val Gin 
15 10 15 

Ser He Asn Trp Gin Lys Ala Phe Asn Arg Gin Ala His His Thr 
20 25 30 

Asp Lys Phe Ser Ser Gin Glu Leu He Leu Arg Arg Gly Gin Asn 
35 40 45 

Phe Gin Val Leu Met He Met Asn Lys Gly Leu Gly Ser Asn Glu 
50 55 60 

Arg Leu Glu Phe He Asp Thr Thr Gly Pro Tyr Pro Ser Glu Ser 
65 70 75 

Ala Met Thr Lys Ala Val Phe Pro Leu Ser Asn Gly Ser Ser Gly 
80 85 90 

Gly Trp Ser Ala Val Leu Gin Ala Ser Asn Gly Asn Thr Leu Thr 
95 100 105 

He Ser He Ser Ser Pro Ala Ser Ala Pro He Gly Arg Tyr Thr 

110 115 120 

Met Ala Leu Gin He Phe Ser Gin Gly Gly He Ser Ser Val Lys 

125 130 135 

Leu Gly Thr Phe He Leu Leu Phe Asn Pro Trp Leu' Asn Val Asp 

140 145 150 

Ser Val Phe Met Gly Asn His Ala Glu Arg Glu Glu Tyr Val Gin 

155 160 165 

Glu Asp Ala Gly He He Phe Val Gly Ser Thr Asn Arg He Gly 

170 175 180 

Met He Gly Trp Asn Phe Gly Gin Phe Glu Glu Asp He Leu Ser 

185 190 195 

He Cys Leu Ser He Leu Asp Arg Ser Leu Asn Phe Arg Arg Asp 

200 205 210 

Ala Ala Thr Asp Val Ala Ser Arg Asn Asp Pro Lys Tyr Val Gly 

215 220 ~ 225 

Arg Val Leu Ser Ala Met He Asn Ser Asn Asp Asp Asn Gly Val 

230 235 240 

Leu Ala Gly Asn Trp Ser Gly Thr Tyr Thr Gly Gly Arg Asp Pro 

245 250 255 

Arg Ser Trp Asp Gly ser Val Glu He Leu Lys Asn Trp Lys Lys 

260 265 270 

Ser Gly Phe Ser Pro Val Arg Tyr Gly Gin Cys Trp Val Phe Ala 
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275 ' 280 285 



Gly 


Thr 


Leu 


Asn 


Thr, 


Ala 


Leu 


Arg Ser 


Leu Gly 


He Pro Ser 


Arg 








290 










295 






300 


Val 


lie 


Thr 


Asn 


Phe 


Asn 


Ser 


Ala His 


Asp 


Thr 


Asp Arg Asn 


Leu 










305 










310 






315 


Ser 


Val 


Asp Val 


Tyr 


Tyr 


Asp 


Pro 


Met 


Gly Asn 


Pro Leu Asp 


Lys 










320 










325 






330 


Gly 


Ser 


Asp 


Ser - 


Val 


Trp 


Asn 


Phe 


His 


Val 


Trp 


Asn Glu Gly 


Trp 








335 










340 






345 


Phe 


Val 


Arg 


Ser 


Asp 


Leu 


Gly 


Pro 


Pro 


Tyr Gly 


Gly Trp Gin 


Val 










350 










355 






360 


Leu 


Asp 


Ala 


Thr 


Pro 


Gin 


Glu 


Arg 


Ser 


Glr.\ 


Gly 


Val Phe Gin 


Cys 








365 










370\ 






375 


Gly 


Pro 


Ala 


Ser 


Val 


lie 


Gly 


Val 


Arg 1 


Glu^Gly 


Asp Val Gin 


Leu 








380 










385 






390 


Asn 


Phe 


Asp Met 


Pro 


Phe 


lie 


Phe 


Ala 


GlU 


Val 


Asn Ala Asp 


Arg 










395 










400 






405 


lie 


Thr 


Trp 


Leu 


Tvr 


Asp 


Asn 


Thr 


Thr 


Gly Lys 


Gin Trp Lys 


Asn 








410 










415' 




420 


Ser 


Val 


Asn 


Ser 


His 


Thr 


He 


Gly Arg 


Tyr 


He 


Ser Thr Lys 


Ala 










425 










43 0 






435 


Val 


Glv 


Ser 


Asn 


Ala 


Ara 


Met 


Asp Val 


Thr 


Asp 


Lys Tyr Lys 


Tyr 








440 










445 






450 


Pro 


GlU 


Gly Ser 


Asp 


Gin 


Glu 


Arg 


Gin 


Val 


Phe 


Gin Lys Ala 


Leu 










455 










460 






465 


Glv 


Lys 


Leu 


Lys 


Pro 


Asn 


Thr 


Pro 


Phe 


Ala 


Ala 


Thr Ser Ser 


Met 




470 










475 






480 


Glv 


Leu 


GlU 


Thr 


Glu 


Glu 


Gin 


Glu 


Pro 


Ser 


He 


He Gly Lys 


Leu 








485 










490 






495 


Lys 


Val 


Ala Gly 


Met 


Leu 


Ala 


Val 


Gly 


Lys 


Glu 


Val Asn Leu 


Val 








500 










505 






510 


Leu 


Leu 


Leu 


Lys 


Asn 


Leu 


Ser 


Arg Asp 


Thr 


Lys 


Thr Val Thr 


Val 










515 










520 






525 


Asn 


Met 


Thr 


Ala 




Thr 


He 


He 


Tyr 


Asn Gly 


Thr Leu Val 


His 










530 










535 






540 


Glu 


Val 


Trp 


Lys 


Asp 


Ser 


Ala 


Thr 


Met 


Ser 


Leu 


Asp Pro Glu 


Glu 








545 










550 






555 


Glu 


Ala 


Glu 


His 


Pro 


lie 


Lys 


He 


Ser 


Tyr 


Ala 


Gin Tyr Glu 


Arg 










560 








565 




570 


Tyr 


Leu 


Lys 


Ser 


Asp 


Asn 


Met 


He Arg 


He 


Thr 


Ala Val Cys 


Lys 








575 










580 






585 


Val 


Pro 


Asp 


Glu 


Ser 


Glu 


Val 


Val 


Val 


Glu Arg 


Asp He He 


Leu 








590 










595 






600 


Asd 


Asn 


Pro 


Thr 


Leu 


Thr 


Leu 


Glu 


Val 


Leu 


Asn 


Glu Ala Arg 


Val 








605 










610 






615 


Arcr 


Lys 


Pro 


Val 


Asn 


Val 


Gin 


Met 


Leu 


Phe 


Ser 


Asn Pro Leu 


Asp 






620 










625 






630 


Glu 


Pro 


Val 


Arg 


Asp 
635 


Cys 


Val 


Leu 


Met 


Val 
640 


Glu 


Gly Ser Gly 


Leu 
645 


Leu 


Leu 


Gly Asn 


Leu 


Lys 


He 


Asp 


Val 


Pro 


Thr 


Leu Gly Pro 


Lys 










650 










655 






660 


Glu 


Arg 


Ser 


Arg 


Val 


Arg 


Phe 


Asp 


He 


Leu 


Pro 


Ser Arg Ser 


Gly 






665 










670 






675 


Thr 


Lys 


Gin 


Leu 


Leu 


Ala 


Asp 


Phe 


Ser 


Cys 


Asn 


Lys Phe Pro 


Ala 








680 










685 






690 


lie 


Lys 


Ala 


Met 


Leu 


Ser 


He 


Asp 


Val 


Ala 


Glu 







695 700 

<210> 222 
<211> 150 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: LI : 255828 . 29 . orf 2 : 2000MAY01 
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<400> 222 



Cys 


Thr 


He Gly 


Pro 


Ala Ser 


Arcr Ser Val Glu Thr Leu Tjvs 

ru. y mcj. vux w±u i.iii> j_i jr o 


Glu 


1 






5 




10 


15 


Met 


He 


Lvs Ser 


Glv 


Met Asn 


Val Ala Arcr Leu Asn Phe Ser 


His 








20 




25 


30 


Glv 


Thr 


His Glu 


Tvr 


His Ala 


Glu Thr He Lvs Asn Val Arcr 


Thr 








35 




40 


45 


Ala 


Thr 


Glu Ser 


Phe 


Ala Ser 


Ast> Pro Tie Leu Tvr Arcr Pro 


Val 








50 




55 


60 


Ala 


Val 


Ala Leu 


Asp 


Thr Lys 


Glv Pro Glu Tie Arcr Thr Glv 










65 




70 


75 


He 


Lvs 


Gly Ser 


Glv 


Thr Ala 


Glu Val Glu Leu Lvs Lvs Ala 


Ala 








80 




85 


90 


Thr 


Leu 


Lys He 


Thr 


Leu Asp 


Asn Ala Tyr Met Glu Lys Cys 


Asp 








95 




100 


105 


GlU 


Asn 


lie Leu 


Trp 


Leu Asp 


Tyr Lys Asn He Cys Lys Val 


Val 








110 




115 


120 


GlU 


Val 


Ser Arg 


Leu 


His His 


Ala Val Trp Arg Asn Ser Gin 


Arg 








125 




130 


135 


Gly 


Leu 


Ser Ser 


Gly 


Gly Cys 


Ala His Ala Ala Pro Asp Ser 


Ser 








140 




145 


150 



<210> 223 
1 <211> 234 
<212> PRT 
<213> Homo sapiens 

<220> 

i <221> misc_feature 

<:223> Incyte ID No: LI : 1190263 . 1 . orf 2 : 2000MAY01 

<400> 223 



Ala 


Ala 


Gly Ser Leu 


Phe Pro Gly Leu 


Leu 


He' Phe Ser Met 


He 


1 




5 


10 




15 


Leu 


Phe 


He Phe Leu 


Leu Gly Tyr Ala 


Trp 


Phe Ser Ser His 


Thr 






20 




25 




30 


Ser 


Pro 


Leu Tyr Trp 


Asp Cys Leu Leu 


Met 


Arg Gly His Glu 


He 






35 




40 


45 


Thr 


Glu 


Gin Pro Met 


Lys Ala Glu Arg 


Ala 


Gly Ser He Met 


Val 






50 




55 




60 


Lys 


Glu 


Ala He Ser 


Phe Leu Glu Arg 


His 


Ser Lys Glu Thr 


Phe 






65 




70 


75 


Leu 


Leu 


Phe Phe Ser 


Phe Leu His Val 


His 


Thr Pro Leu Pro 


Thr 






80 




85 




90 


Thr 


Asp 


Asp Phe Thr 


Gly Thr Ser Lys 


His 


Gly Leu Tyr Gly 


Asp 






95 




100 




105 


Asn 


Val 


Asp Glu Met 


Asp Ser Met Val 


Gly 


Lys He Leu Asp 


Ala 






110 




115 




120 


He 


Asp 


Asp Phe Gly 


Leu Arg Asn Asn 


Thr 


Leu Val Tyr Phe 


Thr 






125 




130 




135 


Ser 


Asp 


His Gly Gly 


His Leu Glu Ala 


Arg 


Arg Gly His Ala 


Gin 






140 




145 




150 


Leu 


Gly 


Gly Trp Asn 


Gly He Tyr Lys 


Gly 


Gly Lys Gly Met 


Gly 






155 




160 




165 


Gly 


Trp 


Glu Gly Gly 


He Arg Val Pro 


Gly 


He Val Arg Trp 


Pro 






170 




175 




180 


Gly 


Lys 


Val Pro Ala 


Gly Arg Leu He 


Lys 


Glu Pro Thr Ser 


Leu 






185 




190 




195 


Met 


Asp 


He Leu Pro 


Thr Val Ala Ser 


Val 


Ser Gly Gly Ser 


Leu 






200 




205 


210 


Pro 


Gin 


Asp Arg Val 


He Asp Gly Arg 


Asp 


Leu Met Pro Leu 


Leu 






215 




220 




225 


Ala 


Gly 


Gin Arg Gin 


Ala Leu Gly Ala 












230 











<210> 224 
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<211> 86 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No : LG: 270916 . 2 . orf 2 : 2000FEB18 

i 

<400> 224 



Lys 


Phe 


Ser 


Ser 


Asn 


Gly Leu 


Trp Pro 


Ser 


Thr 


Trp 


Leu 


Leu 


Thr 


1 








5 








10 










15 


Thr 


Arg 


Thr 


Leu 


Pro 


Met 


lie 


Ser Arg 


Cys 


Ser 


Pro 


Met 


His 


Leu 










20 








25 










30 


Leu 


Thr 


lie 


Ser 


Ser 


Ala 


Phe 


Cys Leu 


Leu 


Cys 


Pro 


Pro 


Pro 


Arg 










35 






40 










45 


Met 


Pro 


Phe 


Gin 


Lys 


Cys 


Leu 


Leu Leu 


Ser 


Arg 


Tyr 


Arg 


Ser 


Arg 










50 








55 










60 


Gly Val 


Leu 


Val 


Ala 


Val 


lie 


Trp Gly 


Thr 


Thr 


GlU 


Ala 


Ser 


Gly 










65 






70 










75 


lie 


Ser 


Gly 


Leu 


lie 


Thr 


Ala 


Leu Trp 


Glu 


Phe 











80 85 

<210> 225 
<211> 173 
<212> PRT 
<213> Homo sapiens 

<220> 

,<221> misc_ feature 

<223> Incyte ID No: LG: 999414 . 3 . orf 2 : 2000FEB18 
<400> 225 



lie 


Leu 


Thr 


Ser 


Val 


Ser Ser 


Ser 


Phe 


Trp 


Cys 


Pro 


Phe 


Phe 


Leu 


1 








5 








10 










15 


Ser 


Leu 


Leu Asp 


Ser 


Gin Leu 


His 


Ser 


Trp 


He 


Val 


Leu 


Gin 


Leu 










20 








25 










30 


Thr 


He 


He 


Lys 


Asn 


Val Glu 


He 


Ser 


Asn 


Leu 


Val 


Cys 


Asp 


Pro 










35 








40 










45 


Ser 


Gin 


Leu 


Leu 


Asn 


Leu Ala 


Cys 


Ser 


Asp 


Ser 


Val 


He 


Asp 


Ser 










50 








55 










60 


He 


Phe 


He 


Tyr 


Leu 


Asp Ser 


Thr 


lie 


Phe 


Gly 


Phe 


Leu 


Pro 


He 








65 








70 










75 


Ser 


Gly 


He 


Leu 


Leu 


Ser Tyr 


Tyr 


Lys 


He 


Val 


Pro 


Ser 


He 


Leu 










80 








85 










90 


Arg 


He 


Ser 


Ser 


Ser 


Asp Gly Lys 


Tyr 


Lys 


Ala 


Phe 


Ser 


Thr 


Cys 










95 








100 










105 


Arg 


Ser 


His 


Leu 


Ala 


Val Val 


Cys 


Leu 


Phe 


Tyr 


Gly Thr 


Gly 


He 










110 








115 










120 


Gly 


Val 


Tyr 


Leu 


Thr 


Ser Ala 


Val 


Ala 


Pro 


Ala 


Pro 


Arg 


Ser 


Gly 






125 








130 










135 


Val 


Val 


Val 


Ser 


Val 


Met Tyr 


Thr 


Val 


Val 


Thr 


Pro 


Met 


Leu 


Asn 










140 






145 










150 


Pro 


Phe 


He 


Tyr 


Cys 


Leu Arg 


Lys 


Gin 


Gly 


His 


Ser 


Lys 


Arg 


Leu 










155 








160 










165 


Trp 


Arg 


Cys 


Ala 


Ala 


Glu Gin 


Ser 

















170 

<210> 226 
<211> 68 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 429446 . 1 . orf 2 : 2000FEB18 
<400> 226 
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0162927A2_I_> 
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His Leu Val AlaThr Val Arg Gly Phe Ser Lys Val Phe Val Ser 
1 5 , 10 15 

Ser Arg lie Lys Thr Val Lys Leu Gin He Val Leu Gin Met Glu 
20 25 30 

Pro Gin Met Gin Ser Met Thr Lys He Tyr His Arg Pro Leu Asp 
35 '40 45 

Arg Pro Ala Ser Pro Cys Ser Asp Val Asp Asp He Glu Gly Ala 
, • ■ 50 55 60 

Pro Pro Lys Glu He Ser Thr Ala 
' 65 

<210> 227 . 
<211> 70 
<212> PRT 

<213> Homo sapiens 1 
<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 057229 . 1 .orf 1 :2000FEB01 , 
<400> 227 

Gin Pro Ser Leu Pro Glu Phe Ser His Phe Gin Lys Thr Val Leu 
1 5 1 10 15 

Leu Glu Ser Lys lie Ala Arg Gin Phe He Leu Phe Tyr Phe He 

20 25 30 

Leu His He Phe Leu Arg Gin Ser Leu Ala Leu Phe Pro Arg Leu 

35 40 .45 ' 1 

Glu Cys Gly Gly Ala Val Leu Ala His Cys Asn Leu Cys Leu Leu 

50 55 60 

Gly Ser Ser Asp Ser Pro Ala Ser Ala Ser 

65 70 



<210> 228 
<211> 117 
<212> PRT 

<213> Homo sapiens 

<220> 1 ' ■ 

<221> misc_feature 

<223> Incyte ID No: LI:351965.1 .orf 2 :2000FEB01 
<400> 228 



Pro 


Thr 


Thr 


Ser Asn 


Arg Ala He Thr 


Leu 


Thr 


Ala Arg Pro 


Lys 


1 






5 




10 






15 


He 


Pro 


Phe 


Leu Arg 


He Arg Glu Ala 


Lys 


Asn 


Pro Arg Ser 


Glu 








20 




25 






30 


Asn 


Met 


Arg 


Leu Ala 


Thr He Leu Glu 


Val 


Ala 


Cys Arg His 


Phe 








35 




40 






45 


Gly 


Ser 


Gly 


Leu Pro 


Pro Ser Trp Glu 


Leu 


Trp 


Glu Gin Gly 


Pro 








50 




55 






60 


Pro 


Gly 


Asn 


Ser Ser 


Arg Tyr He Glu 


Phe 


Leu 


Asn Lys His 


Thr 








65 




70 






75 


Tyr 


He 


Lys 


Gly Thr 


Leu Arg Val Tyr 


Thr 


Lys 


Lys Phe Cys 


Met 








80 




85 






90 


Leu 


Val 


He 


Lys Ser 


Phe Glu Ser Lys 


Ser 


Cys 


Val Cys Val 


Tyr 








95 




100 






105 


Asp 


Phe 


Asp 


Ser Lys 


Ser Ser Val Asn 


Val 


Thr 


Val 








,110 




115 









<210> 229 
<211> 294 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> mis cofeature 

<223> Incyte ID No: LG: 068682 . 1 .orf 2 :2000FEB18 
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BNSDOCID: <WO 0162927A2J_> 



WO 01/62927 



PCT/US01/06059 



<400> 229 
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Gly His 


Met 
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Val 


Arg Gly 


Cys 
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5 




10 
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Arg 
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Ala 


Pro 


Trp 


Leu 


Arg Pro 


Glu 


He 


Gly Val Arg 


Pro 
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25 
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Pro 
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Arg 
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Ala 


Ala 


Ser Cys 


Pro 


Pro 


Cys 


Ala 


Leu 


Gly 








' 35 
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45 
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Thr 


Met 


Ser Gly 


Asp 


Lys 


Leu Leu 


Ser 


Glu 


Leu Gly Tyr 


Lys 
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55 










60 


Leu 


Gly 


Arg 


Thr 


He 


Gly 


Glu 


Gly Ser 


Tyr 


Ser 


Lys 


Val 


Lys 


Val 








65 








70 










75 


Ala 


Thr 


Ser 


Lys 


Lys 


Tyr 


Lys 


Gly Thr 


Val 


Ala 


lie Lys Val 


Val 








80 








85 










90 


Asp 


Arg 


Arg 


Arg 


Ala 


Pro 


Pro 


Asp Phe, 


Val 


Asn 


Lys 


Phe 


Leu 


Pro 








95 








100 










105 


Arg 


Glu 


Leu 


Ser 


He 
110 


Leu 


Arg 


Gly Val 


Arg 
115 


His 


Pro 


His 


lie 


Val 
120 


His 


Val 


Phe 


Glu 


Phe 
125 


He 


Glu 


Val Cys 


Asn 
130, 


Gly 


Lvs 


Leu 


Tyr 


He 
135 


Val 


Met 


Glu 


Ala 


Ala 
140 


Ala 


Thr 


Asp Leu 


Leu 
145 


Gin 


Ala 


Val 


Gin 


Arg 
150 


Asn 


Gly 


Arg 


He 


Pro 


Gly 


Val 


Gin, Ala 


Arg 


Asp 


Leu 


Phe 


Ala 


Gin 
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160 










165 


He 


Ala 


Gly 


Ala 


Val 


Arg 


Tyr 


Leu His 


Asp 


His 


His 


Leu Val 


His 
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175 










180 


Arg 


Asp 


Leu 


Lys 


Cys 


Glu 


Asn 


Val Leu 


Leu 


Ser 


Pro 


Asp 


Glu 


Arg 
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190 










195 


Arg 


Val 


Lys 


Leu 


Thr 


Asp 


Phe 


Gly Phe 


Gly 


Arg 


Gin 


Ala 


His 


Gly 






200 








205 










210 


Tyr 


Pro 


Asp 


Leu 


Ser 


Thr 


Thr 


Tyr Cys 


Gly 


ser 


Ala 


Ala 


Tyr 


Ala 






215 






220 










225 


Ser 


Pro 


Glu 


Val 


Leu 


Leu 


Gly 


He Pro 


Tyr 


Asp 


Pro 


Lys 


Lys 


Tyr 










230 






235 










240 


Asp 


Val 


Trp 


Ser 


Met 


Gly 


Val 


Val Leu 


Tyr 


Val 


Met 


Val 


Thr 


Gly 






245 






250 










255 


Cys 


Met 


Pro 


Phe 


Asp 


Asp 


Ser 


Asp He 


Ala 


Gly 


Leu 


Pro 


Arg 


Arg 








260 








265 










270 


Gin 


Lys 


Arg 


Gly Val 


Leu 


Tyr 


Pro ' Glu 


Gly 


Leu 


Glu 


Leu 


Ser 


Glu 






275 








280 










285 


Arg 


Cys 


Lys 


Ala 


Leu 
290 


He 


Ala 


Glu Leu 















<210> 230 
<211> 326 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 242665 . 1 . orfl : 2000FEB18 
<400> 230 



His 


His 


Leu 


He 


Ser 


Leu 


Tyr 


Phe 


Thr 


Asp 


Phe 


Pro 


He 


Ser 


Phe 


1 








5 








10 










15 


Phe 


Met 


Phe 


Tyr 


Ala 


Asn 


Phe 


Ser 


Arg 


Arg 


Thr 


Gly 


Pro 


Ala 


Pro 
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Pro 


Leu 


Arg 


Thr 


Thr 


Pro 


Arg 


Ala 


Trp 


Leu 


Arg 


Arg 


Glu 
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Gly 
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Ala 


Ser 


Thr 


Met 


Ser 


Ala 


Pro 


Gly 


Ser 


Pro 


Asp 


Gin Ala Tyr 


Asp 










50 








55 










60 


Phe 


Leu 


Leu 


Lys 


Phe 


Leu 


Leu 


Val 


Gly 


Asp 


Arg 


Asp 


Val 


Gly 


Lys 








65 










70 










75 


Ser 


Glu 


He 


Leu 


Glu 


Ser 


Leu 


Gin 


Asp 


Gly 


Ala 


Ala 


Glu 


Ser 


Pro 
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85 










90 


Tyr 


Ser 


His 


Leu 


Gly 


Gly 


He 


Asp 


Tyr 


Lys 


Thr 


Thr 


Thr 


He 


Leu 








95 










100 










105 


Leu 


Asp 


Gly 


Gin 


Arg 


Val 


Lys 


Leu 


Lys 


Leu 


Trp 


Asp 


Thr 


Ser 


Gly 
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120 
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p 1 1-1 


Ala 

Aia 


TVjtt" A1 a pi ii 
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i on 
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J-iGu (jiy 


vai inr 


Fne 


Phe Glu Val Ser 
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Leu 


v_ys nsn rue 


Asn 








one 
Z uj 






oi n 

Z J.U 


Tl ~ Tl — . 

lie lie 


Glu Ser 


File 


inr giu Leu Aia 


Arg 


Tl A 
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vai lieu ueu 


Arg 
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zz u 
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ZZ3 


riis Arg 


wee Asn 


Trp 


beu Giy Arg Fro 


ber 


Lys 


vai lieu ber 


Leu 






zoU 




97 5 
Z J3 






9/t n 


Gin Asp 


Leu Cys 


Cys 


Arg inr lie vai 


Oat> 

ber 


Cys 


inr Fro vai 


HIS 






245 




250 






255 


Leu Val 


Asp Lys 


Leu 


Pro Leu Pro Ser 


Thr 


Leu 


Arg Ser His 


Leu 




260 


Ala Lys Gly Leu 


265 




270 


Lys Ser 


Phe Ser 


Met 


Asn 


Ala 


Arg Met Met 


Arg 






275 




280 






285 


Gly Leu 


Ser Tyr 


Ser 


Leu Thr Thr Ser 


Ser 


Thr 


His Lys Arg 


Ser 




290 


295 






300 


Ser Leu 


Cys Lys 


Val 


Lys lie Val Cys 


Pro 


Pro 


Gin Ser Pro 


Pro 




305 


310 






315 


Lys Asn 


Cys Thr 


Arg 


Asn Ser Cys Lys 


He 


Ser 







<210> 231 
<211> 182 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 241743 . 1 . orfl : 200 0FEB18 
<400> 231 



Lys 


Ser Gly Thr Pro 


Arg Arg Ala Leu 


Leu 


Leu Phe Leu Val 


Phe 


1 


5 




10 




15 


Lys 


He Arg Gly Ser 


Pro Val Ser His 


Leu 


Met Pro Arg Leu 


Lys 




20 




25 




30 


Glu 


Ser Arg Ser His 


Glu Ser Leu Leu 


Ser 


Pro Ser Ser Ala 


Val 




35 




40 




45 


Glu 


Ala Leu Asp Leu 


Ser Met Glu Glu 


Glu 


Val Val He Lys 


Pro 




50 




55 




60 


Val 


His Ser Ser He 


Leu Gly Gin Asp 


Tyr 


Cys Phe Glu Val 


Thr 




65 


70 




75 


Thr 


Ser Ser Gly Ser 


Lys Cys Phe Ser 


Cys 


Arg Ser Ala Ala 


Glu 




80 




85 




90 


Arg 


Asp Lys Trp Met 


Glu Asn Leu Arg 


Arg 


Ala Val His Pro 


Asn 




95 




100 




105 


Lys 


Asp Asn Ser Arg 


Arg Val Glu His 


He 


Leu Lys Leu Trp 


Val 




110 




115 




120 


He 


Glu Ala Lys Asp 


Leu Pro Ala Lys 


Lys 


Lys Tyr Leu Cys 


Glu 




125 




130 




135 


Leu 


Cys Leu Asp Asp 


Val Leu Tyr Ala 


Arg 


Thr Thr Gly Lys 


Leu 




140 




145 




150 


Lys 


Thr Asp Asn Val 


Phe Trp Gly Glu 


His 


Phe Glu Phe His 


Asn 




155 


160 




165 


Leu 


Pro Pro Leu Arg 


Thr Val Thr Val 


His 


Leu Tyr Arg Glu 


Thr 




170 




175 


180 


Asp 


Lys 











320 



325 
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<210> 232 
<211> 358 
<212> PRT 

<213> Homo sapiens 1 
<220> 

<221> misc_f eature 

<223> Incyte ID No: LI : 034212 . 1 .orf 1 : 2000FEB01 
<220> 

<221> unsure 
<222> 25 

<223> unknown or other 
<400> 232 
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Ser 


Ser Leu 


Thr 


Gin 


Leu Arg Arg 


Leu 


Glu 


Glu 


Leu 


Asp 


Leu 
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Pro 


Xaa 


Ser 


lie Gly Ala 


Leu 
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30 
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Trp 


Leu Asp 


Glv 


Asn 


Gin 


Leu 


Ser 


Glu 
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40 










45 


UcU 


riv 


Gin Gin 


JL J- t= 


Gl v 


Ash Leu Lys 


Asn 


Leu 


Leu 


Cys 


Leu Asp 
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60 


Val 


Ser 


Glu Asn 


Arg 


Leu 


Glu Arg Leu 


Pro 


Glu 


Glu 


He 


Ser Gly 


















70 










75 


Leu 


J. 1.11. 


Ser Leu 


mi. 




Leu 


Val 


lie 


Ser 


Gin 


Asn 


Leu 


Leu 


Glu 








AO 








85 










90 


Thr 


lie 


Pro Asp 


Glv 


lie 


Gly Lys 


Leu 


Lys 


Lys 


Leu 


Ser 


lie 


Leu 






95 










100 










105 


Lys 


Val 


Asn Gin 


Asn 


Arg 


Leu 


Thr 


Gin 


Leu 


Pro 


Glu 


Ala 


Val 


Glv 




110 










115 










120 


Glu 


Cvs 


Glu Ser 


Leu 


Thr 


Glu 


Leu 


Val 


Leu 


Thr 


Glu 


Asn 


Gin 


Leu 






125 










13 0 










135 


Leu 


Thr 


Leu Pro 


Lys 


Ser 


He Gly Lys 


Leu 


Lys 


Lys 


Leu 


Ser 


Asn 








140 










145 










150 


Leu 


Asn 


Ala Asp 


Arcr 


Asn 


Lys 


Leu 


Val 


Ser 


Leu 


Pro 


Lys 


Glu 


lie 






155 










160 










165 


Gly 


Gly 


Cys Cys 


Ser 


Leu 


Thr 


Val 


Phe 


Cys 


Val 


Arg Asp 


Asn 


Arg 






170 










175 










180 


Leu 


Thr 


Arg lie 


Pro 


Ala 


Glu 


Val 


Ser 


Gin 


Ala 


Thr 


Glu 


Leu 


His 






185 










190 










195 


Val 


Leu 


Asp Val 


Ala 


Gly 


Asn Arg 


Leu 


Leu 


His 


Leu 


Pro 


Leu 


Ser 






200 








205 










210 


Leu 


Thr 


Ala Leu 


Lys 


Leu 


Lys 


Ala 


Leu 


Trp 


Leu 


Ser 


Asp 


Asn 


Gin 








215 








220 










225 


Ser 


Gin 


Pro Leu 


Leu 
230 


Thr 


Phe 


Gin 


Thr 


Asp 
235 


Thr 


Asp 


Tyr 


Thr 


Thr 
240 


Gly 


Glu 


Lys lie 


Leu 


Thr 


Cys 


Val 


Leu 


Leu 


Pro 


Gin 


Leu 


Pro 


Ser 




245 








250 










255 


Glu 


Pro 


Thr Cys 


Gin 


Glu 


Asn 


Leu 


Pro 


Arg 


Cys 


Gly Ala 


Leu 


Glu 






260 










265 










270 


Asn 


Leu 


Val Asn 


Asp 


Val 


Ser 


Asp 


Glu 


Ala 


Trp 


Asn 


Glu Arg 


Ala 








275 










280 










285 


Val 


Asn 


Arg Val 


Ser 


Ala 


He 


Arg 


Phe 


Val 


Glu 


Asp Glu 


Lys 


Asp 






290 










295 










300 


Glu 


Glu 


Asp Asn 


Glu 


Thr 


Arg 


Thr 


Leu 


Leu 


Arg Arg 


Ala 


Thr 


Pro 






305 










310 










315 


His 


Pro 


Gly Glu 


Leu 


Lys 


His 


Met 


Lys 


Lys 


Thr 


Val 


Glu 


Asn 


Leu 
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330 


Arg 


Asn 


Asp Met 
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Ala 


Ala 


Lys 


Gly Leu 


Asp 


Ser 


Asn 


Lys 
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Glu 


Val 
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He 
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Val 







350 355 

<210> 233 
<211> 194 
<212> PRT 
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BNSDOCID: <WO 0162927A2 I > 



WO 01/62927 PCT/US01/06059 
<213> Homo sapiens 



<220> 

<221> misc_f eature • 

<223> Incyte ID No: LG: 344886 . 1 .orfl : 2000MAY19 
<400> 233 , 

Glu Lys Met Gly Lys Gly Cys Lys Val. Val Val Cys Gly Leu Leu 

1 ,5 10 15 

Ser Val Gly Lys Thr Ala lie Leu Glu Gin Leu Leu Tyr Gly Asn 

20 25 30 

His Thr lie Gly Met. Glu Asp Cys Glu Thr Met Glu Asp Val Tyr 

35 40 ; . ' 45 

Met Ala Ser Val Glu Thr Asp Arg Gly Val Lys Glu Gin Leu His 

50 55 60 

Leu Tyr Asp Thr Arg Gly Leu Gin Glu Gly Val Glu Leu Pro Lys 

65 70 75 

His Tyr Phe Ser Phe Ala Asp Gly Phe Val Leu Val Tyr Ser Val 

80 85 90 

Asn Asn Leu Glu Ser Phe Gin Arg Val Glu Leu Leu Lys Lys Glu 

95 100' 105 

lie Asp Lys Phe Lys Asp Lys Lys Glu Val Ala lie Val Val Leu 

110 115 120 

Gly Asn Lys lie Asp Leu Ser Glu' Gin Arg Gin Val Asp Ala Glu 

1^5 . 130 135 

Val Ala Gin Gin Trp Ala Lys Ser Glu Lys Val Arg Leu Trp Glu 

140 145 150 

Val Thr Val Thr Asp Arg Lys Thr Leu lie Glu Pro Phe Thr Leu 

155 160 165 

Leu Ala Ser Lys Leu Ser Gin Pro Gin Ser Lys Ser Ser Phe Pro 

170 175 180 
Leu Pro Gly Arg Lys Asn Lys Gly Asn Ser Asn Ser Glu Asn 
185 190 

<210> 234 
<211> 222 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LG: 228930 . 1 .orf 2 :2000MAY19 
<400> 234 

Ala Gin Met Ala Gly Ala Gin Pro Gly Val His Ala Leu Gin Leu 
15 10 15 

Lys Pro Val Cys Val Ser Asp Ser Leu Lys Lys Gly Thr Lys Phe 
20 25 30 

Val Lys Trp Asp Asp Asp Ser Thr lie Val Thr Pro lie lie Leu 
35 40 45 

Arg Thr Asp Pro Gin Gly Phe Phe Phe Tyr Trp Thr Asp Gin Asn 
50 55 60 

Lys Glu Thr Glu Leu Leu Asp Leu Ser Leu Val Lys Asp Ala Arg 
65 70 75 

Cys Gly Arg His Ala Lys Ala Pro Lys Asp Pro Lys Leu Arg Glu 
80 85 90 

Leu Leu Asp Val Gly Asn lie Gly Arg Leu Glu Gin Arg Met lie 
95 100 105 

Thr Val Val Tyr Gly Pro Asp Leu Val Asn lie Ser His Leu Asn 
110 115 120 

Leu Val Ala Phe Gin Glu Glu Val Ala Lys Glu Trp Thr Asn Glu 
125 130 135 

Val Phe Ser Leu Ala Thr Asn Leu Leu Ala Gin Asn Met Ser Arg 
140 145 150 

Asp Ala Phe Leu Glu Lys Ala Tyr Thr Lys Leu Lys Leu Gin Val 
155 160 165 

Thr Pro Glu Gly Arg lie Pro Leu Lys Asn lie Tyr Arg Leu Phe 
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<221> misc feature 
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orf3 :2000MAY19 
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sapiens 
















<220> 






















<221> misc_feature 
















<223> Incyte ID 
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LG: 898771.1 


.orf 2:2000MAY19 




<400> 236 
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Ser Lys Gly Arg 


Val 


Gly 


Ala Ala Gly Arg 










125 




130 




135 


His 


Val 


Tip 


Arg 


Arg 


Gin Pro Gin Gly 


Leu 


Ser 


Pro Pro Gly Ala 










140 




145 




150 


Val 


Val 


His 


Leu 


Val 


Thr Gin Asp Arg 


Ala 


lie 


Val Thr Arg Arg 










155 




160 




165 


Gly 


Arg 


His 


Arg 


Gin 


Pro Arg Ala Cys 


Gly 


Arg 


Val Leu Glu Val 










170 




175 




180 


Val 


Ser 


Ala 


His 


Arg 


Glu Trp Ser Arg 


Ser 


Trp 


Arg 










185 


190 







<210> 237 

<211> 61 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_ feature 

<223> Incyte ID No: LI : 257664 . 67 . orf3 :2000MAY01 
<400> 237 

Ala His Ser Lys Pro Glu Lys lie Val Asn Lys Pro Asn Lys His 

1 5 10 15 

Glu Asp His Thr Gly Lys Leu Arg Pro Glu Thr Arg Glu Glu Asn 

20 25 30 

Lys Asn His Leu Lys Asp His Gin Pro Tyr Trp His Thr Phe Val 

35 40 45 

Asn Asn Thr Gin Phe Pro Asp lie Trp Glu Gin Val Lys Cys Val 

50 55 60 

Thr 



<210> 238 

<211> 335 1 ' 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 001496 .2 .orf 2 :2000MAY01 
<400> 238 



Arg 


Cys Gly Ala Ala 


Ala Ser Ala Gly 


Arg 


Glu 


Ser Ala Ala Gly 


1 


5 




10 




15 


Ser 


Glu Glu Gin Ala 


Gly Leu Arg Pro 


Ser 


Gin 


Leu Arg Gly Pro 




20 




25 




30 


Pro 


Asp Pro Pro Thr 


Glu Thr Ala Ala 


Val 


Ser 


Gly Gin Ala Val 




35 




40 




45 


Gly 


Ala Ala Trp Pro 


Ala Ala Gly Lys 


Met 


Phe 


Ser Val Glu Ser 


50 


55 




60 


Leu 


Glu Arg Ala Glu 


Leu Cys Glu Ser 


Leu 


Leu 


Thr Trp He Gin 




65 




70 




75 


Thr 


Phe Asn Val Gly 


Cys Thr .Met Pro 


Glu 


Pro 


Val Glu Asp Leu 




, 80 




85 




90 


Thr 


Asn Gly Val Val 


Met Ala Gin Val 


Leu 


Gin 


Lys He Asp Pro 




95 




100 




105 


Ala 


Tyr Phe Asp Glu 


Asn Trp Leu Asn 


Arg 


He 


Lys Thr Glu Val 




110 




115 




120 


Gly 


Asp Asn Trp Arg 


Leu Lys lie Ser 


Asn 


Leu 


Lys Lys He Leu 




125 




130 




135 


Lys 


Gly lie Leu Asp 


Tyr Asn His Glu 


He 


Leu 


Gly Gin Gin He 




140 




145 




150 


Asn 


Asp Phe Thr Leu 


Pro Asp Val Asn 


Leu 


He 


Gly Glu His Ser 



120/228 



BNSDOC1D: <WO__0162927A2_I_> 
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155 160 ^R5 

Asp Ala Ala Glu Leu Gly Arg Met Leu Gin Leu He Leu Gly Cys 

170 175 180 

Ala Val Asn Cys Glu Gin Lys Gin Glu Tyr He Gin Ala He Met 

185 190 195 

Met Met Glu Glu Ser Val Gin His Val Val Met Thr Ala He Gin 

200 , 205 210 

Glu Leu Met Ser Lys Glu Ser Pro Val Ser Ala Gly Asn Asp Ala 

215 220 225 

Tyr Val Asp Leu Asp Arg Gin Leu Lys Lys Thr Thr Glu Glu Leu 

230 235 240 

Asn Glu Ala Leu Ser Ala Lys Glu Glu He Ala Gin' Arg Cys His 

245 ' 250 255 

Glu Leu Asp Met Gin Val Ala Ala Leu Gin Glu Glu Lys Ser Ser 

260 265 270 

Leu Leu Ala Glu Asn Gin Val Leu Met Glu Arg Leu Asn Gin Ser 

275 280 285 

Asp Ser He Glu Asp Pro Asn Ser Pro Ala Gly Arg Arg His Leu 

290 295 .300 

Gin Leu Gin Thr Gin Leu Glu Gin Leu Gin' Glu Glu Thr Phe Arg 

305 310 315 

Leu Glu Ala Ala Lys Asp Asp Tyr Arg He Arg Cys Glu Glu Leu 

320 325 330 

Glu Lys Gly Asp Leu 

335 

<210> 239 
<211> 346 
<212> PRT 
<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> incyte ID No: LI: 1085273. 

<400> 239 



Ala 


Arg 


Ala 


Gly Ser 


Pro 


Pro 


Arg 


1 








5 








Ala 


His 


Cys 


Ser 


Arg 
20 


Ala 


Cys 


Ala 


Ala 


Cys 


Arg 


Thr 


Leu 


Thr 


Ala 


Ser 






35 








Ser 


Ser 


Leu 


Pro 


Thr 
50 


Pro 


Leu 


Ala 


Pro 


Pro 


Thr 


Pro 


Ala 
65 


Arg 


Pro 


Arg 


Arg 


Arg 


Gly 


Pro 


Arg 
80 


Trp 


Ala 


Cys 


Arg 


Thr 


Pro 


Arg 


Leu 
95 


Arg 


Arg 


Gly 


Arg 


Pro 


Pro 


Ala 


Gly 
110 


Ser 


Pro 


Ala 


Gly 


Leu 


Asn 


Phe 


Gly Asp 


Ala 


Ala 










125 








Leu 


Ser 


Ala 


Leu 


Ser 
140 


Ala 


Pro 


Gly 


Ala 


Gly 


Pro 


Gly , Ala 


Trp 


Ala 


Pro 










155 








Pro 


Ser 


Pro 


Asp Gly Pro 


Trp 


Cys 










170 








Gly 


Ala 


Gly 


Gly Val 


Leu 


Phe 


Ala 










185 








Pro 


Gly 


Pro 


Gly Gly Gly 


Ser 


Asp 










200 








Arg 


Ala 


Glu 


Pro 


Arg 
215 


Asp 


Ala 


Arg 


Ala 


Pro 


Glu 


Thr 


Gin 


Phe 


Lys 


Arg 



.orfl:2000MAY01 



Pro 


Pro 


Arg 


Pro Arg Arg Pro 




10 




15 


Ala 


Cys 


Thr 


Ser Pro Arg Thr 




25 




30 


Ser 


Ala 


Pro 


Ser Pro Trp Thr 




40 




45 


Gly 


Gly 


Pro 


Thr Ala Pro Gly 




55 




60 


Ser 


Ser 


Ala 


Ser Trp Thr Ala 




70 




75 


Pro 


Arg 


Pro 


Ala Arg Thr Ala 




85 




90 


Pro 


Arg 


Pro 


Arg Arg Arg Pro 




100 




105 


Arg 


Ser 


Pro 


Ala His Ser Leu 


115 




120 


Arg 


Gin 


Thr 


Pro Arg His Gly 




130 




135 


Leu 


Pro 


Gly 


Pro Gly Gin Pro 




145 




150 


Pro 


Leu 


Asp 


Ser Pro Gly Thr 




160 




165 


Phe 


Ser 


Pro 


Glu Gly Ala Gin 




175 




180 


Pro 


Phe 


Gly 


Arg Ala Gly Ala 




190 




195 


Leu 


Arg 


Arg 


Arg Glu Ala Ala 




205 




210 


Thr 


Gly 


Trp 


Pro Glu Glu Pro 




220 




225 


Arg 


Ser 


Cys 


Gin Met Glu Phe 



121/228 

1 
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• 



*0 235 240 

Glu Glu Gly Met Val Glu Gly Arg Ala Arg Gly Glu Glu Leu Ala 
245 250 255 

Ala Leu Gly Lys Gin Ala Ser Phe Ser Gly Asn Val Glu Val lie' 
260 265 270 

Gin Val Val Ser Asp Pro Ser Ala Alia Phe Gly Pro Ala Ala Arg 
275 280 285 

Ser Gin Ala Arg Asn Lys Cys lie Leu Tyr lie Met Gin Arg Lys 
290 295 300 

Val Asn Gly Leu Thr Gly Asn Phe Asn Pro Arg Ser Lys lie lie 
305 310 315 

Ser lie Phe Tyr Leu Phe Lys Leu Phe He Leu Ala Met Asp Leu 
320 325 ( 330 

Ala Thr Val Arg Val Val Leu Glu Leu Pro Phe Leu Leu Ser Gly 
335 340 345 

He 



<210> 240 
<211> 298 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID NO: LI:333138 .2 .orf 3 :2000MAY01 
<400> 240 



Ala 


Thr Trp Ala 


Phe 


He Ser Ala Pro 


Val 


Pro 


Val Phe Pro Asp 


1 




5 




10 




15 




Phe Gly He 


Lys 


riJLcl ofcsX ocJ. ulU 


Ala 


OcL 


1 XIX. LicU V7J.ll nla 




20 




25 




30 


Met 


Gly Arg Lys Glu 


Glu Asp Asp Cys 


Ser 


Ser 


Trp Lys Lys Gin 






35 




40 




45 


Thr 


Thr Asn He Arg 


Lys Thr Phe He 


Phe 


Met 


Glu Val Leu Gly 






50 


55 




60 


Ser 


Gly Ala Phe 


Ser 


Glu Val Phe Leu 


Val 


Lys 


Gin Arg Leu Thr 






65 




70 




75 


Gly 


Lys Leu Phe 


Ala 


Leu Lys Cys He 


Lys 


Lys 


Ser Pro Ala Phe 






80 




85 




90 


Arg 


Asp Ser Ser 


Leu 


Glu Asn Glu He 


Ala 


Val 


Leu Lys Lys He 






95 




100 




105 


Lys 


His Glu Asn 


He 


Val Thr Leu Glu 


Asp 


He 


Tyr Glu Ser Thr 




110 




115 




120 


Thr 


His Tyr Tyr 


Leu 


Val Met Gin Leu 


Val 


Ser 


Gly Gly Glu Leu 






125 




130 




135 


Phe 


Asp Arg He 


Leu 


Glu Arg Gly Val 


Tyr 


Thr 


Glu Lys Asp Ala 






140 




145 




150 


Ser 


Leu Val He 


Gin 


Gin Val Leu Ser 


Ala 


Val 


Lys' Tyr Leu His 






155 




160 




165 


Glu 


Asn Gly He 


Val 


His Arg Asp Leu 


Lys 


Pro 


Glu Asn Leu Leu 






170 




175 




180 


Tyr 


Leu Thr Pro 


Glu 


Glu Asn Ser Lys 


He 


Met 


He Thr Asp Phe 






185 




190 




195 


Gly 


Leu Ser Lys 


Met 


Glu Gin Asn Gly 


He 


Met 


Ser Thr Ala Cys 






200 




205 




210 


Gly 


Thr Pro Gly Tyr 


Val Ala Pro Glu 


Val 


Leu 


Ala Gin Lys Pro 






215 




220 




225 


Tyr 


Ser Lys Ala Val 


Asp Cys Trp Ser 


He 


Gly 


Val He Thr Tyr 






230 




235 


240 


He 


Leu Leu Cys 


Gly 


Tyr Pro Pro Phe 


Tyr 


Glu 


Glu Thr Glu Ser 






245 


250 




255 


Lys 


Leu Phe Glu Lys 


He Lys Glu Gly 


Tyr 


Tyr 


Glu Phe Glu Ser 






260 




265 




270 


Pro 


Phe Trp Asp Asp 


He Ser Glu Ser 


Ala 


Lys 


Asp Phe He Cys 






275 




280 




285 


His 


Leu Leu Glu 


Lys 


Asp Pro Asn Glu 


Gly 


Val 


Thr Leu 
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290 295 

<210> 241 
<211> 133 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc__f eature 

<223> Incyte ID No: LI : 33 8927 . 1 . orf 1 : 2000MAY01 
<400> 241 



Asp 


Pro 


Pro 


Arg 


Glu 


Thr Gly Arg Met 


Lys 


Gly 


Ala Ser Glu 


Glu 


1 








5 




10 






15 


Lys 


Leu 


Ala 


Ser 


Val 


Ser Asn Leu Val 


Thr 


Val 


Phe Glu Asn 


Ser 








20 




25 






30 


Arg 


Thr 


Pro 


Glu 


Ala 


Ala Pro Arg Gly 


Gin 


Arg 


Leu Glu Asp 


Val 








35 




• 40 






45 


His 


His 


Arg 


Pro 


Glu 


Cys Arg Pro Pro 


Glu 


Ser 


Pro Gly Pro 


Arg 








50 


55 






60 


Glu 


Lys 


Thr 


Asn 


Val* 


Gly Glu Ala Val 


Gly 


Ser 


Glu Pro Arg 


Thr 








65 


70 






75 


Val 


Ser 


Arg 


Arg 


Tyr 
80 


Leu Asn Ser Leu 


Lys 
85 


Asn 


Lys Leu Ser 


Ser 
90 


Glu 


Ala 


Trp 


Arg 


Lys 
95 


Ser Leu Pro Ala 


Cys 
100 


Asp 


Pro Leu Arg 


He 
105 


Gly 


Asp 


Ala 


Gly 


Ala 
110 


Arg Glu Glu Asp 


Arg 
115 


Pro 


Gly Ala Ala 


Trp 
120 


Arg 


His 


Asp 


Ala 


Gly 
125 


His Met Trp Arg 


Ala 
130 


Ser 


Thr Cys 





<210> 242 
<211> 354 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LG : 33 5558 . 1 . orf 2 : 200 0FEB18 
<220> 

<221> unsure 
<222> 341 

<223> unJcnown or other 
<400> 242 



Leu 


He Gly val 


Leu 


Gin 


Val 


Leu 


Gin 


Val 


Glu 


Leu 1 Gly He Asn 


1 






5 










10 






15 


Ser 


Val Thr 


Gly 


Thr 


Ser 


Thr 


Val 


Asn 


Asn 


Val 


Asn He Thr 


Ala 






20 










25 






30 


Val 


Gly Ser 


Phe 


Asn 


Pro 


Asn 


Val 


Thr 


Ser 


Ser 


Met Leu Gly Asn 






35 










40 






45 


Val 


Asn He 


Ser 


Thr 


Ser 


Asn 


He 


Pro 


Ser 


Ala 


Ala Gly Val 


Ser 








50 










55 




60 


Val 


Gly Pro 


Gly 


Val 


Thr 


Ser Gly Val 


Asn 


Val 


Asn He Leu 


Ser 




65 










70 






75 


Gly 


Met Gly Asn 


Gly Thr 


He 


Ser 


Ser 


Ser 


Ala 


Ala Val Ser 


Ser 






80 










85 






90 


Val 


Pro Asn 


Ala 


Ala Ala Gly Met Thr 


Gly 


Gly 


Ser Val Ser 


Ser 








95 










100 




105 


Gin 


Gin Gin 


Gin 


Pro 


Thr 


Val 


Asn 


Thr 


Ser 


Arg 


Phe Arg Val 


Val 








110 










115 


120 


Lys 


Leu Asp 


Ser 


Ser 


Ser 


Glu 


Pro 


Phe 


Lys 


Lys 


Gly Arg Trp 


Thr 








125 










13 0 






135 


Cys 


Thr Glu 


Phe 


Tyr 


Glu 


Lys 


Glu 


Asn 


Ala 


Val 


Pro Ala Thr 


Glu 








140 








145 






150 


Gly 


Val Leu 


He 


Asn 


Lys 


Val 


Val 


Glu 


Thr 


Val 


Lys Gin Asn 


Pro 



123/228 
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L55 160 165 

He Glu Val Thr Ser Glu Arg Glu Ser Thr Ser Gly Ser Ser Val 
.170 175 180 

Ser Ser Ser Val Ser. Thr Leu Ser His Tyx Thr Glu Ser Val Gly 
185 190 195 

Ser Gly Glu Met Gly Ala Pro Thr Val Val Val Gin Gin Gin Gin 
200 205 210 

Gin Gin Gin Arg Leu Leu Gin Gin Glri.Pro Ala Leu Gin Gly Val 
' , 215 220 225 

Thr Leu Gin Gin- Met Asp Phe Gly Ser Thr Gly Pro Gin Ser He 
230 235 240 

Pro Ala Val Ser He Pro Gin Ser He Ser Gin Ser Gin He Ser 
245 250 . ' 255 

Gin Val Gin Leu Gin Ser Gin Glu Leu Ser Tyr Gin Gin Lys Gin 
260 265 270 

Gly Leu Gin Pro Val Pro Leu Gin Ala ' Thr Met Ser Ala Ala Thr 
275 280 285 

Gly He Gin Pro Ser Pro Val Asn Val Val Gly Val Thr Ser Ala 
290 295 300 

Leu Gly Gin Gin Pro Ser He Ser Ser Leu Ala Gin Pro Gin Leu 
305 310' 315 

Pro Tyr Ser Gin Ala Ala Pro Pro Val Gin Thr Pro Leu Pro Gly 
320 325 330 

Ala Pro Pro Pro Gin Gin Leu Gin' Tyr Gly Xaa Gin Gin Pro Met 
335 . 340 • 345 

Val Ser Thr Gin Met Ala Pro Gly Met 
350 

<210> 243 
<211> 237 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: LG: 998283 . 7 .orfl :2000FEB18 
<400> 243 ' .' . 

Leu Tyr Cys He Cys Lys Thr Pro, Tyr Asp Glu Ser Lys Phe Tyr 
1 5 10 15 

He Gly Cys Asp Leu Cys Thr Asn Trp Tyr His Gly Glu Cys Val 
20 25 30 

Gly He Thr Glu Lys Glu Ala Lys Lys Met Asp Val Tyr He Cys 
35 40 45 

Asn Asp Cys Lys Arg Ala Gin Glu Gly Ser Ser Glu Glu Leu Tyr 
50 55 60 

Cys He Cys Arg Thr Pro Tyr Asp Glu Ser Gin Phe Tyr He Gly 
65 70 75 

Cys Asp Arg Cys Gin Asn Trp Tyr His Gly Arg Cys Val Gly He 
80 85 90 

Leu Gin Ser Glu Ala Glu Leu He Asp Glu Tyr Val Cys Pro Gin 
95 100 105 

Cys Gin Ser Thr Glu Asp Ala Met Thr Val Leu Thr Pro Leu Thr 
110 115 120 

Glu Lys Asp Tyr Glu Gly Leu Lys Arg Val Leu Arg Ser Leu Gin 
125 130 135 

Ala His Lys Met Ala Trp Pro Phe Leu Glu Pro Val Asp Pro Asn 
140 145 150 

Asp Ala Pro Asp Tyr Tyr Gly Val He Lys Glu Pro Met Asp Leu 
155 160 165 

Ala Thr Met Glu Glu Arg Val Gin Arg Arg Tyx Tyr Glu Lys Leu 
170 175 180 

Thr Glu Phe Val Ala Asp Met Thr Lys He Phe Asp Asn Cys Arg 
185 190 195 

Tyr Tyr Asn Pro Ser Asp Ser Pro Phe Tyr Gin Cys Ala Glu Val 
200 205 210 

Leu Glu Ser Phe Phe Val Gin Lys Leu Lys Gly Phe Lys Ala Ser 



124/228 
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215 . 220 225 

Arg Ser His Asn Asn Lys Leu Gin Ser Thr Ala Ser 
23(j 235 

<210> 244 1 ' 

<211> 161 * 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 402739 - 1 .orfl :2000FEB01 
<400> 244 

Pro Pro Trp Gly Gin Arg Ser Pro Thr Pro Pro Ser Asp Thr Gly 
1 5 10 15 

Gly Thr Ser Arg Pro Arg Thr Met He Pro Pro Gly Glu Cys Thr 
20 25 30 

Tyr Ala Gly Arg Lys Arg Arg Arg Pro Leu, Gin Lys Gin Arg Pro 
35 40 45 

Ala Val Gly Ala Glu Lys Ser Asn. Pro Ser Lys Arg His Arg Asp 
50 ,55 60 

Arg Leu Asn Ala Glu Leu Asp His Leu Ala Ser Leu Leu Pro Phe 
65 70 75 

Pro Pro Asp He He Ser Lys Leu Asp Lys Leu Ser Val Leu Arg 
80 85 90 

Leu Ser Val Ser Tyr Leu Arg Val Lys Ser Phe Phe Gin Gly Gin 
95 100 105 

Gly Leu Ala Val Ala Asp Ala Glu Asp Val Asp Asp His Thr Gly 
110 115 120 

Glu Arg Arg Pro Met Ser Phe Arg Arg Pro Arg Ala Leu Asp Thr 
125 130 135 

Gin Ala Leu Arg Arg Thr Gin Phe Gly Leu His Leu Leu Met Val 
140 145 150 

Asn He Ala Gly Leu He Ala Thr Asp Arg Leu 
, 155 160 

<210> 245 
<211> 151 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 175223 . 1 .orf 3 : 2000FEB01 
<400> 245 



He 


Asp 


Ala 


Glu Asp 


His 


Ser Val Pro 


Lys 


Gly Lys 


Phe 


Ser Ser 


1 






5 






10 






15 


His 


Glu 


Phe 


Gly Ala 


Glu 


Gly Pro Trp 


Gly 


Asn Met 


Ala 


Glu Gly 








20 




25 






30 


Gly 


Ala 


Ser 


Lys Gly 


Gly 


Gly Glu Glu 


Pro 


Gly Lys 


Leu 


Pro Glu 








35 






40 






45 


Pro 


Ala 


Glu 


Glu Glu 


ser 


Gin Val Leu 


Arg 


Gly Thr 


Gly 


His Cys 








50 






55 






60 


Lys 


Trp 


Phe 


Asn Val 


Arg 


Met Gly Phe 


Gly 


Phe He 


Ser 


Met He 








65 






70 






75 


Asn 


Arg 


Glu 


Gly Ser 


Pro 


Leu Asp He 


Pro 


Val Asp 


Val 


Phe Val 






80 






85 






90 


His 


Gin 


Ser 


Lys Leu 


Phe 


Met Glu Gly 


Phe 


Arg Ser 


Leu 


Lys Glu 








95 






100 






105 


Gly 


Glu 


Pro 


Val Glu 


Phe 


Thr Phe Lys 


Lys 


Ser Ser 


Lys 


Gly Leu 






110 




115 






120 


Glu 


Ser 


He 


Arg Val 


Thr 


Gly Pro Gly 


Gly 


Ser Pro 


Cys 


Leu Gly 








125 




130 






135 


Ser 


Glu 


Arg 


Arg Pro 


Lys 


Gly Lys Thr 


Leu 


<31n Lys 


Arg 


Lys Pro 








140 


145 






150 



125/228 
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<210> 246 
<211> 160 

<212> PRT 1 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LG: 981076 . 2 . orf 2 :2000MAY19 
<220> 

<221> unsure 
<222> 157 

<223> unknown or other 
<400> 246 



Met 


Ala 


Ser 


Lys 


Val 


Thr Asp 


Ala 


He 


Val 


Trp 


Tyr Gin Lys Lys 


1 








5 








10 




15 


He 


Gly 


Ala 


Tyr Asp 


Gin Gin 


He 


Trp 


Glu 


Lys 


Ser Val Glu Gin 










20 








25 




30 


Arg 


Glu 


He 


Lys 


Gly 


Leu Arg 


Asn 


Lys 


Pro 


Lys 


Lys Thr Ala His 










35 






40 




45 


Val 


Lys 


Pro 


Asp 


Leu 


He Asp 


Val 


Asp 


Leu 


Val 


Arg Gly Ser Ala 










50 








55 




60 


Phe 


Ala 


Lys 


Ala 


Lys 


Pro Glu 


Ser 


Pro 


Trp 


Thr 


Ser Leu Thr Arg 










65 








70 




75 


Lys 


Gly 


He 


Val 


Arg 


Val Val 


Phe 


Phe 


Pro 


Phe 


Phe Phe Arg Trp 










80 








85 




90 


Trp 


Leu 


Gin 


Val 


Thr 


Ser Lys 


Val 


He 


Phe 


Phe 


Trp Leu Leu Val 










95 








100 




105 


Leu 


Tyr 


Leu 


Leu 


Gin 


Val Ala 


Val 


He 


Val 


Leu 


Phe Cys Ser Thr 










110 








115 




120 


Ser 


Ser 


Pro 


His 


Ser 


He Pro 


Leu 


Thr 


Glu 


Val 


He Gly Pro He 










125 








130 




135 


Trp 


Leu 


Met 


Leu 


Leu 


Leu Gly 


Thr 


Val 


His 


Cys 


Gin He Val Ser 










140 






145 


150 


Thr 


Arg 


Thr 


Pro 


Lys 


Pro Xaa 


Leu 


Ser 


Thr 







155 160 

<210> 247 
<211> 160 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 1008973 . 1 .orf3 :2000MAY01' 
<400> 247 



Leu 


His He 


Pro Arg 


Ser 


Pro 


Pro 


Gly 


Asp 


Arg Ala Ala Arg Thr 


1 




5 










10 


15 


Gly 


His Pro 


Arg Leu 


Pro 


Val 


Pro 


Pro 


Pro 


Arg Ala Arg Thr Glu 






20 










25 


30 


Pro 


Arg Pro 


Arg Gly 


Gin 


Arg 


Arg 


Leu 


His 


Ser Ser Gly Glu Met 






35 










40 


45 


Ala 


Ala Gly 


Ser Thr 


Thr 


Leu 


Arg 


Ala 


Val 


Gly Lys Leu Gin Val 






50 








55 


60 


Arg 


Leu Ala 


Thr Lys 


Thr 


Glu 


Pro 


Lys 


Lys 


Leu Glu Lys Tyr Leu 






65 










70 


75 


Gin 


Lys Leu 


Ser Ala 


Leu 


Pro 


Met 


Thr 


Ala 


Asp He Leu Ala Glu 






80 










85 


90 


Thr 


Gly Leu 


Arg Lys 


Thr 


Val 


Lys 


Arg 


Leu 


Arg Lys His Gin His 






95 










100 


105 


Val 


Gly Asp 


Phe Ala 


Arg 


Asp 


Leu 


Ala 


Ala 


Arg Trp Lys Lys Leu 






110 










115 


120 
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Val Leu Val Asp Arg Asn Thr Gly Pro Asp Pro Gin Asp Pro Glu 
125, 130 135 

Glu Ser Ala Ser Arg Gin Arg Tyr Gly Glu Ala Leu Gin Glu Arg 
140 145 150 

Glu Lys Gly Trp Gly Leu Pro Arg Lys Arg 
155 160 
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<400> 248 



Arg 


Ala 


His 


Ser 


Gly 


Val 


Leu 


Met Ser 


Ala 


Met 


Leu Ser His 


Gly 


1 








5 








10 • 






15 


Val 


Leu 


Lys 


Arg 


Ala 


Ser 


GlU 


Arg Gly 


Ala 


Glu 


Arg His Ser 


Leu 








20 








25 






30 


Pro 


Pro 


Ser 


Arg 


Leu 


Val 


Leu 


Val Pro 


Gly 


Arg 


Arg Ala Leu Arg 








35 








40 






45 


Ser 


Ala 


Pro 


Gin 


Val 


Pro 


Gly 


Ser Gly 


Trp 


Arg 


V^.1 Gly Thr 


Glu 










50 






55 






60 


Pro 


Pro 


Val 


Leu 


His 


Asp 


Pro 


Ala Gly 


Arg 


Gly 


Arg Phe Pro 


Gin 










65 






70 




75 


Ser 


Gly 


Glu 


Val 


Ser 


Ala 


Ala 


Pro Glu 


Met 


Ser 


Lys Leu Ser 


Phe 








80 








85 




90 


Arg 


Ala 


Arg 


Ala 


Leu 


Asp 


Ala 


Ser Lys 


Pro 


Leu 


Pro Val Phe 


Arg 








95 








100 






105 


cys 


GlU 


Asp 


Leu 


Pro 
110 


Asp 


Leu 


His Glu 


Tyr 
115 


Ala 


Ser He Asn 


Arg 
120 


Ala 


Val 


Pro 


Gin 


Met 


Pro 


Thr 


Gly He 


Glu 


Lys 


Glu Glu Glu 


Ser 










125 






13 0 






135 


Glu 


His 


His 


Leu Pro 


Ala 


Gly 


Leu Phe 


Gin 


His 


Ser Arg Cys 


Met 










140 






145 






150 


Ala 


Arg 


Arg 


Gly 


He 


Tyr 


Met 


Val lie 


Pro 


Val 


Pro Glu Ala 


Glu 






155 








160 






165 


Ser 


Asn 


Tyr 


Cys 


Leu 
170 


Leu 
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Pro 


Gly 


Met 


Ser 


Val 


Ala 


Gly 


Val 


Glu 


Gly 


Glu 


Pro Leu Val Ser 


1 






5 








10 




15 


Ser 


Gin 


Ser 


Gly 


Gin 


Ser 


Pro 


Pro 


Glu 


Pro 


Gin 


Asp Pro Glu Ala 








20 










25 




30 


Pro 


Ser 


Ser 


Ser 


Gly 


Pro 


Gly 


His 


Leu 


Val 


Ala 


Met Gly Lys Val 










35 








40 




45 


Ser 


Arg 


Thr 


Pro 


Val 


Glu 


Ala 


Gly 


Val 


Ser 


Gin 


Ser Asp Ala Glu 










50 










55 




. 60 


Asn 


Ala 


Ala 


Pro 


Ser 


Cys 


Pro 


Asp 


Glu 


His 


Asp 


Thr Leu Pro Arg 










65 








70 




75 


Arg 


Arg 


Gly 


Arg 


Pro 


Ser 


Arg 


Arg 


Phe 


Leu 


Gly 


Lys Lys Tyr Arg 










80 










85 




90 


Lys 


Tyr 


Tyr 


Tyr 


Lys 


Ser 


Pro 


Lys 


Pro 


Leu 


Leu 


Arg Pro Phe Leu 










95 








100 




105 


Cys 


Arg 


He 


Cys 


Gly 


Ser 


Arg 


Phe 


Leu 


Ser 


His 


Glu Asp Leu Arg 
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Tio 








115 








120 


Phe 


His 


Val Asn 


Ser 


His 


GlU 


Ala Gly Asp 


Pro 


Gin 


Leu Phe 


Lys 








125 








130 








135, 


cys 


Leu 


Gin Cys 


Ser 


Tyr Arg 


Ser Arg Arg 


Trp 


Ser 


Ser Leu 


Lys 








140 








145 








150 


Glu 


His 


Met Phe 


Asn 


His 


Val 


Gly Ser 


Lys 


Pro 


Tyr 


Lys Cys 


Asp 




• 




155 








160 








165 


Glu 


Cys 


Ser Tyx 


Thr 


Ser 


Val 


Tyr Arg Lys 


Asp 


Val 


He Arg 


His 








170 








175 








180 


Ala 


Ala 


Val His 


Ser 


Arg Asp 


Arg Lys 


Lys 


Arg 


Pro 


Asp Pro 


Thr 








185 








190 








195 


Pro 


Lys 


Leu Ser 


Ser 


Phe 


Pro 


Cys Pro 


Val 


Cys 


Gly 


Arg Val 


Tyr 








200 








205 


• 






210 


Pro 


Met 


Gin Lys 


Arg 


Leu 


Thr 


Gin His 


Met 


Lys 


Thr 


His Ser 


Thr 








215 








220 








225 


Glu 


Lys 


Pro His 


Met 


Cys Asp 


Lys Cys 


Gly 


Lys 


Ser 


Phe Lys 


Lys 








230 








235 






Gin Ala 


240 


Arg 


Tyr 


Thr Phe 


Lys 


Met His 


Leu Leu 


Thr 


His 


He 


Val 








245 








250 








255 


Ala 


Asn 


Arg Arg 


Phe 


Lys 


Cys 


Glu Phe Cys 


Glu 


Phe 


Val Cys 


Glu 








260 






265 








270 


Asp 


Lys 


Lys Ala 


Leu 


Leu Asn 


His Gin Leu 


Ser 


His 


Val Ser 


Asp 








275 








280 








285 


Lys 


Pro 


Phe Lys 


Cys 


Ser Phe 


Cys Pro Tyx 


Arg 


Thr 


Phe Arg 


Glu 








290 








295 








300 


Asp 


Phe 


Leu Leu 


Ser 


His 


Val 


Ala Val 


Lys 


His 


Thr 


Gly Ala 


Lys 








305 








310 






315 


Pro 


Phe 


Ala Cys 


Glu 


Tyr Cys 


His Phe 


Ser 


Thr 


Arg 


His Lys 


Lys 








320 








325 








330 


Asn 


Leu 


Arg Leu 


His 


Val 


Arg 


Cys Arg 


His 


Ala 


Ser 


Ser Phe 


Glu 


• 






335 






340 








345 


Glu 


Trp 


Gly Arg 


Arg 


His 


Pro 


Glu Glu 


Pro 


Pro 


Ser 


Arg Arg 


Arg 








350 








355 








360 


Pro 


Phe 


Phe Ser 


Leu 


Gin 


Gin 


lie Glu 


Glu 


Leu 


Lys 


Gin Gin 


His 








3 65 








370 






375 


Ser 


Ala 


Ala Pro 


Gly 


Pro 


Pro 


Pro Ser 


Ser 


Pro 


Gly 


Pro Pro 


Glu 








380 








385 








390 


lie 


Pro 


Pro Glu 


Ala 


Thr 


Thr 


Phe Gin 


Ser 


Ser 


Glu 


Ala Pro 


Ser 








395 








400 








405 


Leu 


Leu 


Cys Ser 


Asp 


Thr 


Leu 


Gly Gly Ala 


Thr 


He 


He Tyr 


Gin 








410 








415 








420 


Gin 


Gly 


Ala Glu 


Glu 


Ser 


Thr 


Ala Met 


Ala 


Thr 


Gin 


Thr Ala 


Leu 








425 








430 








435 


Asp 


Leu 


Leu Leu 


Asn 


Met 


Ser 


Ala Gin Arg 


GlU 


Leu 


Gly Gly 










440 








445 
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<223> Incyte ID No: LG:475404 . 1 . orf 2 :2000FEB18 
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Leu 


Thr 


Pro 


Gly 


His 


Pro Gly Ser 


Arg Gly Met 


Asp 


Ser Val 


Ala 


1 








5 




10 






15 


Phe 


GlU 


Asp 


Val 


Ala 


Val Asn Phe 


Thr Gin Glu 


Glu 


Trp Ala 


Leu 










20 




25 






30 


Leu Asp 


Ser 


Ser 


Gin 


Lys Asn Leu 


Tyr Arg Glu 


Val 


Met Gin 


Glu 










35 




40 






45 


Thr 


Cys 


Arg 


Asn 


Leu 


Ala Ser Val 


Gly Ser Gin 


Trp 


Lys Asp 


Gin 










50 




55 




60 


Asn 


He 


Glu 


Asp 


His 


Phe Glu Lys 


Pro Gly Lys 


Asp 


He Arg 


Asn 










65 




70 






75 


His 


He 


Val 


Gin 


Arg 


Leu Cys Glu 


Ser Lys Glu 


Asp 


Gly Gin 


Tyr 
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80 

Gly Glu Val Val Ser 
95 

lie Ser Thr Gly Leu 
110 

Val Phe Val Arg His 
125 

<210> 251 
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<220> 
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<220> 

<221> unsure 
<222> 136 

<223> unknown or other 
<400> 251 



Asn 


Ser 


Leu Ser 


Val 


Ala 


Ser 


Ala Pro 


Pro 


Gin 


Arg Asp Pro 


Gly 


-1 






5 








10 






15 


Met 


Ala 


Met Ala 


Leu 


Pro 


Met 


Pro Gly 


Pro 


Gin 


Glu Ala Val 


Val 








20 






25 






30 


Phe 


Glu 


Asp Val 


Ala 


Val 


Tyr 


Phe Thr 


Arg 


He 


Glu Trp Ser 


Cys 






35 






40 






45 


Leu 


Ala 


Pro Asp 


Gin 


Gin 


Ala 


Leu Tyr 


Arg 


Asp 


Val Met Leii 


Glu 






50 






55 






60 


Asn 


Tyr 


Gly Asn 


Leu 


Ala 


Ser 


Leu Gly 


Phe 


Leu 


Val Ala Lys 


Pro 




65 








70 






75 


Ala 


Leu 


lie Ser 


Leu 


Leu 


Glu 


Gin Gly 


Glu 


Glu 


Pro Gly Ala 


Leu 








80 






85 






90 


He 


Leu 


Gin Val 


Ala 


GlU 


Gin 


Ser Val 


Ala 


Lys 


Ala Ser Leu 


Cys 








95 








100 






105 


Thr 


Glu 


Asp Pro 


Asn 


Thr 


Leu 


Pro Ser 


Arg 


Ser 


Gin Glu Gly 


Ser 






110 








115 






120 


Pro 


Ala 


Ser Ser 


Glu 


Gly 


Gly 


Pro Gly 


GlU 


Lys 


Gly Val Ala 


Gly 








125 




130 






135 


Xaa 


Val 


Ala Gly 


Gly 


Gly 


Ala 


Ala Ser 


Ser 


Trp 


Pro His Gly 


Glu 






140 






145 






150 


His 


Pro 


Val Thr 


Pro 


Asn 


Arg 


















155 
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Asp 


Thr 


Met 


Gin 


Ala 


Val 


Val 


Pro 


Leu 


Asn 


Lys 


Met 


Thr Ala 


He 


1 








5 










10 








15 


Ser 


Pro 


GlU 


Pro 


Gin 


Thr 


Leu 


Ala 


Ser 


Thr 


GlU 


Gin 


Asn Glu 


Val 










20 










25 








30 


Pro 


Arg 


Val 


Val 


Thr 


Ser 


Gly 


Glu 


Gin 


Glu 


Ala 


He Leu Arg Gly 










35 








40 








45 


Asn 


Ala 


Ala 


Asp 


Ala 


Glu 


Ser 


Phe 


Arg 


Gin 


Arg 


Phe 


Arg Trp 


Phe 








50 










55 








60 


Cys 


Tyr 


Ser 


GlU 


Val 


Ala 


Gly 


Pro 


Arg 


Lys 


Ala 


Leu 


Ser Gin 


Leu 








65 








70 








75 


Trp 


Glu 


Leu 


cys 


Asn 


Gin 


Trp 


Leu 


Arg 


Pro 


Asp 


lie 


His Thr 


Lys 



PCT/US01/06059 



85 ^90 

Gin He Pro Asn Leu Asp Leu Asn Glu Asn, 
100 105 
Lys Pro Cys Glu Cys Ser He Cys Gly Lys 
115 120 

Ser Leu 
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^80 






85 


90 


Glu 


Gin 


lie 


Leu Glu. Leu Leu 


yal Phe 


Glu Gin 


Phe Leu Thr He 








95 




100 


105 


Leu 


Pro 


Gly 


Glu lie Arg lie 


Trp Val 


Lys Ser 


Gin His Pro Glu 






110 




115 


120 


Ser 


Ser 


Glu 


Glu Val 


Val Thr 


Leu lie 


Glu Asp 


Leu Thr Gin Met 








125 




Ser Gin, 


130 


135 


Leu 


Glu 


Glu 


Lys Asp 
, 140 


Pro Val 


Asp Ser 
145 


Thr Val Ser Gin 
150 


Glu 


Glu 


Asn 


Ser 1 Lys 


Glu Asp 


Lys Met 


Val Thr 


Val Cys Pro Asn 








155 




160 


165 


Thr 


Glu 


Ser 


Cys Glu 
170 


Ser lie 


Thr Leu 


Lys Asp 
175 


Val Ala Val Asn 

; ' 180 


Phe 


Ser 


Arg 


Gly Glu Trp Lys 


Lys Leu 


Glu Pro 


Phe Gin Lys Glu 








185 






190 


195 


Leu 


Tyr 


Lys 


Glu Val 


Leu Leu 


Glu Asn' 


Leu Arg 


Asn Leu Glu Phe 




200 






205 


210 


Leu 


Asp 


Phe 


Pro Val 


Ser Lys 


Leu Glu 


Leu He 


Ser Gin Leu Lys 






215 




220 


225 


Trp 


Val 


Glu Leu Pro 


Trp Leu 


Leu Glu 


Glu Val 


Ser Lys Ser Ser 






230 




235' 


240 


Arg 


Leu 


Asp 


Glu Ser 


Ala Leu 


Asp Lys 


He He 


Glu Arg Cys Leu 






245 




250 


255 


Arg 


Asp 


Asp 


Asp His 
2'60 


Gly Leu 


Met' Glu 


Glu Ser 
265 


Gin Gin Tyr Cys 
270 


Gly 


Ser 


Ser 


Glu Glu 
275 


Asp His 


Gly Asn 


Gin Gly 
280 


Asn Ser Lys Gly 
285 


Arg 


Val 


Ala 


Gin Tyr 
290 


Lys Thr 


Leu Gly 


Ser Gly 
295 


Ser Arg Gly Lys 
300 


Lys 


Phe 


Asp 


Pro Asp 
305 
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Glu 


Cys 


Ser 


Arg Val Thr 


Val 


Thr 


Glu 


His 


Trp Ser Lys Val Phe 


1 




5 








10 


15 


Pro 


Lys 


Gly 


Gin Gly Ser 


Gin 


Glu 


His 


Leu 


Leu Lys Leu Met Thr 




20 








25 


30 


Met 


Gly 


Asp 


Met Lys Thr 


Pro 


Asp 


Phe 


Asp 


Asp Leu Leu Ala Ala 








35 








40 


45 


Phe 


Asp 


He 


Pro Asp Met 


Val 


Asp 


Pro 


Lys 


Ala Ala He Glu Ser 






50 








55 


60 


Gly 


His 


Asp 


Asp His Glu 


Ser 


His 


Met 


Lys 


Gin Asn Ala His Gly 




65 








70 


75 


Glu 


Asp 


Asp 


Ser His Ala 


Pro 


Ser 


Ser 


Ser 


Asp Val Gly Val Ser 




80 








85 


90 


Val 


He 


Val 


Lys Asn Val 


Arg 


Asn 


He 


Asp 


Ser Ser Glu Gly Gly 








95 








100 


105 


Glu 


Lys 


Asp 


Gly His Asn 


Pro 


Thr Gly 


Asn 


Gly Leu His Asn Gly 






,110 








115 


120 


Phe 


Leu 


Thr 


Ala' Ser Ser 


Leu 


Asp Ser 


Tyr 


Ser Lys Asp Gly Ala 








125 








130 


135 


Lys 


Ser 


Leu 


Lys Gly Asp 


Val 


Pro 


Ala 


Ser 


Glu Val Thr Leu Lys 






140 








145 


150 


Asp 


Ser 


Thr 


Phe Ser Gin 


Phe 


Ser 


Pro 


lie 


Ser Ser Ala Glu Glu 






155 








160 


165 


Phe 


Asp 


Asp 


Asp Glu' Lys 


He 


Glu Val 


Asp 


Asp Pro Pro Asp Lys 








170 








175 


180 


Glu 


Asp 


Met 


Arg Ser Ser 


Phe 


Arg 


Ser 


Asn 


Val Leu Thr Gly Ser 
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185 










190 










195 


Ala 


Pro 


Gin Gin Asp 




Asp 


Lys Leu Lys Ala Leu Gly Gly Glu 










200 

£t \J \J 








205 










210 


Asn 


Ser 


Ser 


Lys 


mi 


Glv 


Leu 




Thr 


Ser Gly Asn Val 


Glu 


Lys 








215 








220 










225 


Asn 


Lys 


Ala Val 




Arg 


Glu 


iiii 


Glu Ala Ser 


Ser 


He 


Asn 


Leu 






Glu- 


23 0 










235 










240 


Ser 


Val 


Tyr 


Pro 


Phe 


Lys 


Val 

VCLX 


**x y 


Lys 


Ala 


Glu- 


Asp 


Lys 


Leu 








245 










250 










255 


Lys 


Glu 


Ser 


Ser 


A cr> 
no £J 


Lys 


Val 


JJCU 


Glu 


Asn 


Arg 


Val 


Leu 


Asp 


Glv 








960 








265 










270 


Lys 


Leu. 


Ser 


Ser 


m ii 

ol u 


Lys 


Asn 


A cm 


Thr 


Q or 

OCl 




P>ro 


Ser Val 


Ala 








275 








£t O V 










285 


XT X !»/ 


Ser 


Lys 


Thr 


Lys 


Ser 


Ser 


Ser 


Lys ' 


T .on 


Opr 

t~> t=l 


Ser 


Cys 


He 


Ala 








290 










295 










300 


Ala 

Ala 


Tie 


Ala 


Ala 


Leu 


Ser 


Ala 


Lys 


Lys 


Ala 


Ala 


Ser 


Asp 


Ser 












305 








•Jin 

X \t 










J X -J 




gi ii 


Pro 


Val 


Ala 


A 


o tsx 


Arg 


Glu 


Qor 

OCX 


Ser 


Pro 


Leu 


Pro 


T A^C 

ii o 








320 










•IOC. 










330 




VCLX 


Asn Asp 


Ser 


Dva 

Jr X. \J 


A rrr 

•**i y 


Ala 


Ala 


Asp 


Lys 


ser 


Pro 


Glu 


Ser 










335 


















Gin 


345 


Gl r\ 

la 111 




Leu 


He Asp 


gi v 


Thr* 
iiu 


Lys Lys 


Pro 


Ser 


Leu Lys 


Pro 










350 










*3 C C 

j jj 










360 




Del 


Pro 


Arg 


Ser 


J-lc 


O (SI 


Ser 


GlU 


Asn 


Ser 


Ser Lys 


Gly 


Ser 






365 










*x n (\ 










375 


pro 


Cot 


Ser 


Pro 


Ala 


«iy 


O /-w» 
Ocl 


Thr 


Pro 


Aia 


He 


Pro 


Lys 


Val 


Arg 










380 








385 










390 


Tin 

lie 


Lys 


Thr 


lie 


Lys 


llll 


day 


Ser 


Gly 


Glu 


He 


Lys 


Arg Thr Val 








395 










400 










An*; 

y u j 


±111 


A >~/"y 

niy 


Val 


Leu 


Pro 


Gl 11 
olU 


val 


Asp 


Leu 


Asp 


Ser Gly Lys 


Lys 


Pro 










410 










415 










Aon 




nl *i 
olU 


Gin 


Thr 


Ala 


ser 


val 


Met 


Ala 


Ser 


Val 


Thr 


Ser 


Leu 












4*SO 










430 










435 


Cor* 


Oar 


Pro 


Ala 


Ser 


AT » 


Ala 
Ala 


Val 


Leu 


Ser 


Ser 


Pro 


Pro 


Arg 


Ala 










A A O 










445 










450 


Pro 


Leu 


Gin 


Ser 


Ala 


Veil 


Vsa 1 
Val 


Thr 


Asn 


Ala 


Val 


Ser 


Pro 


Ala 


GlU 










/ICC 










460 










465 




J- ill 


Pro 


Lys 


bin 


Val 


i. Ill 


He 


Lys 


Pro 


Val 


Ala 


Thr 


Ala 


Phe 








470 








475 










480 


JJCU 


3r±. \J 


Val 
Val 


Ser 


Ala 


Val 




Thr Ala Gly 


Ser 


Gin 


Val 


He 


Asn 










485 








490 










495 


T.Qt I 




Leu 


Ala 


Asn 


S cn 


nil 


Thr 


Val 


Lys 


Ala 


Thr 


Val 


He 


Ser 








500 










505 










510 


AT a 


Al a 

Aid 


Ser 


Val 


Gin 


Ser 


Ala 


Ser 


Ser 


Ala 


He 


He 


Lys 


Ala 


Ala 










515 










520 










525 


/ion 


Ala 


He 


Gin 


Gin 


Gin 

will 


mi 


Val 


Val 


Val 


Pro 


Ala 


Ser 


Ser 


Leu 










530 










535 










540 


Ala 




Ala 


Lys 


Leu 


Val 


Pro 


Lys 


Thr 


Val 


His 


Leu 


Ala 


Asn 


Leu 








545 








550 










555 


Asn 


Leu, 


Leu 


Pro 


Gin 


Glv 


Ala 


Gin 


Ala 


Thr 


Ser 


Glu 


Leu 


Arg 


Gin 










560 








565 










570 


Val 


Leu. 


Thr 


Lys 


Pro 


Gin 


Gin 


Gin 


He 


Lys 


Gin 


Ala 


He 


He 


Asn 








575 










580 










585 


Ala 


Ala 


Ala 


Ser 


Gin 


Pro 


Pro 


Lys 


Lys 


Val 


Ser 


Arg 


Val 


Gin 


Val 










590 










595 










600 


Val 




Ser 


Leu 


Gin 


Ser 


Ser 


Val 


Val 


Glu 


Ala 


Phe 


Asn 


Lys 


Val 










605 










610 










615 


Leu. 


Ser 


Ser 


Val 


Asn 


Pro 


Val 


Pro 


Val 


Tyr 


He 


Pro 


Asn 


Leu 


Ser 










620 










625 










630 


Pro 


Pro 


Ala 


Asn 


Ala 


Glv 


lie 


Thr 


Leu 


Pro 


Thr Arg Gly Tyr 


Lys 










635 








640 










645 


Cys 


Leu 


Glu 


Cys 


Gly 


Asp 


Ser 


Phe 


Ala 


Leu 


Glu 


Lys 


Ser 


Leu 


Thr 








650 










655 










660 


Gin 


His 


Tyr 


Asp 


Arg 


Arg 


Ser 


Val 


Arg 


He 


Glu Val 


Thr 


Cys 


Asn 








665 








670 










675 


His 


Cys 


Thr 


Lys 


Asn 


Leu 


Val 


Phe 


Tyr 


Asn Lys 


Cys 


Ser 


Leu 


Leu 








680 










685 










690 
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ser His Ala Ar^Gly His Lys Glu Lys Gly Val Val Met Gin Cys 
695 700 705 

Ser His Leu lie Leu Ser Gin Ser Gin Gin lie Lys 
710 715 

<210> 254 
<211> ,211 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 1076828 . 1 . orfl : 2000FEB18 
<220> 

<221> unsure 

<222> 25, 54, 93, 113, 116, 121 , 
<223> unknown or other 

<400> 254 

Thr Pro Lys Pro Gin Val lie Ser Leu Leu Glu Gin Gly Lys Glu 
1 ' 5 10 15 

Pro Trp Met Val Gly Arg Glu Leu Thr Xaa Gly Leu Cys Ser Asp 
20 25 30 

Leu Glu Ser Met Cys Glu Thr Lys Leu Leu Ser Leu Lys Lys Glu 
35 40 45 

Val Tyr Glu lie Glu Leu Cys Gin Xaa Glu lie Met Gly Leu Thr 
50 55 60 

Lys His Gly Leu Glu Tyr Ser Ser Phe Gly Asp Val Leu Glu Tyr 
65 70 75 

Arg Ser His Leu Ala Lys Gin Leu Gly Tyr Pro Asn Gly His Phe 
80 85 90 

Ser Gin Xaa lie Phe Thr Pro Glu Tyr Met Pro Thr Phe He Gin 
95 100 105 

Gin Thr Phe Leu Thr Leu His Xaa He He Xaa Asn Glu Asp Arg 
110 115 120 

Xaa Tyr Glu Cys Lys Glu Cys Gly Lys Met Phe Ser His Gly Ser 
125 130 135 

Gin Leu Thr Gin His Gin Arg He His Thr Gly Glu Lys Pro Tyr 
140 145 150 

Gin Cys Lys Glu Cys Gly Lys Ala Phe Asn Arg Gly Ser Leu Leu 
155 160 165 

Thr Arg His Gin Arg He His Thr Gly Glu Lys Pro Tyr Glu Cys 
170 175 180 

Lys Glu Cys Gly Lys Thr Phe Ser Arg Gly Ser Glu Leu Thr Gin 
185 190 195 

His Glu Arg He His Thr Ala Gly Ala Pro Leu Leu Ser Trp Gly 
200 205 , 210 

Leu 



<210> 255 
<211> 103 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 1076931 . 1 .orf2 :2000FEB18 
<400> 255 

Gly Gly Glu Gly Gin Ser Ala Asp Leu Arg Pro Leu Leu Thr Asp 
15 10 15 

Phe Arg Leu Gin Phe He Cys Ala Pro Ala Ser Ser Leu Ser Leu 

20 25 30 

Arg Arg Leu Arg Leu Arg Pro Gin Lys Glu He Ser He Leu Cys 

35 40 45 
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Pro Glu Gin Asn 


Arg 
50 


Met Ala Met Ser 


Gin 
55 


Glu 


Ser 


Leu 


Thr 


Phe 
60 


Lys Asp Val Phe 


Val 


Gly Phe Thr Leu 


Glu 


GlU 


Trp 


Gin 


Gin 


Leu 


65 




' 70 










75 


Asp Pro Ser Gin 


Arg 


Ala Leu Tyr Arg 


Asp 


Val 


Met 


Leu 


Glu 


Asn 




80 


85 










90 


Tyr Ser Asn Leu 


Val 
95 


Ser Val Gly Tyr 


Cys 
100 


Ala 


His 


Lys 







<210> 256 
<211> 84 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: LG : 1078121 . 1 . or f 3 : 2000FEB18 
<400> 256 



Glu Gly 


He 


Pro 


Glu 


Lys 


Lys Glu Glu 


Glu 


Glu 


Glu 


Met 


Ala 


Gly 


1 






5 




10 










15 


Ser Gin Gly Leu 


Leu 


He 


Phe Arg Asp 


Val 


Ala 


He 


Glu 


Phe 


Ser 








20 




25 










30 


Pro Glu 


Glu 


Trp 


Ser 


Tyr 


Leu Asp Pro 


Ala 


Gin 


Gin 


Asn 


Leu 


Tyr 






35 


40 










45 


Arg Asp 


Val 


Met 


Leu 


Glu 


Asn Tyr Arg Asn Leu Val 


Ser 


Leu Gly 






50 






55 










60 


He Ala 


Val 


Ser 


Lys 
65 


Pro 


Glu Leu He 


Thr 
70 


Cys 


Leu 


Glu 


Gin 


Arg 
75 


Asn Glu 


Pro 


Trp 


Asn 


Val 


Lys Lys His 















80 

<210> 257 
<211> 194 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> mi sc_f eature 

<223> Incyte ID No: LG : 1079203 . 1 . orfl : 2000FEB18 
<400> 257 



Asp 


Ala 


Arg 


Thr 


Thr 


Trp 


Lys 


Pro 


Arg 


Asn 


Val 


He Tyr Ser 


His 


1 








5 






10 






15 


Phe 


Thr 


Glu 


Asp 


Leu 


Trp 


Pro 


GlU 


His 


Ser 


He 


Lys Asp Ser 


Phe 








20 








25 






30 


Gin 


Lys 


Val 


He 


Leu 


Arg 


Gly 


Tyr 


Gly 


Lys 


cys 


Gly His Glu Asn 










35 










40 






45 


Leu 


Gin 


Leu 


Arg 


He 


Ser 


Cys 


Lys 


Ser 


Val 


Asp 


Glu Ser Lys 


Val 










50 








55 






60 


Phe 


Lys 


Glu 


Gly 


Tyr 
65 


Asn 


Glu 


Leu 


Asn 


Gin 
70 


cys 


Leu Arg Thr 


Thr 
75 


Gin 


Ser 


Lys 


He 


Phe 
80 


Gin 


Cys 


Asp 


Lys 


Tyr 
85 


Val 


Lys Val Phe 


His 
90 


Lys 


Phe 


Ser 


Asn 


Ser 


Asn 


Ser 


His 


Lys 


Lys 


Arg 


Asn Thr Gly Lys 










95 










100 






105 


Lys 


Val 


Phe 


Lys 


cys 
110 


Lys 


Glu 


Cys 


Gly 


Lys 
115 


Ser 


Phe Cys Met 


Leu 
120 


Ser 


His 


Leu 


Thr 


Gin 
125 


His 


He 


Arg 


He 


His 
130 


Thr 


Arg Glu Asn 


Ser 
135 


Tyr 


Lys 


Cys 


Lys 


Glu 
140 


Cys 


Gly 


Lys 


Val 


Leu 
145 


Asn 


Gin Ser Ser 


Glu 
150 


Leu 


He 


Lys 


His 


Lys 


Lys 


He 


His 


Thr 


Gly 


Glu 


Lys Pro Tyr 


Thr 








155 










160 






165 


Cys 


Glu 


Lys 


Cys 


Gly 
170 


Lys 


Thr 


Phe 


Asn 


Gin 
175 


Ser 


Ala Asn Leu 


Tyr 
180 


Ala 


His 


Lys 


Lys 


He 


His 


Thr 


Gly 


Asp 


Lys 


Thr 


He Gin Val 
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<210> 258 
<211> 129 
<212> PRT 
<213> Homo sapiens 

<220> , • • 

<221> misc_feature 

<223> Incyte ID' No: LG: 1082586 . 1 . orf 1 : 2000FEB18 
<220> 

<221> unsure 
<222> 5 

<223> unknown or other 
<400> 258 



Ala 


Gin 


Pro 


Arg 


Xaa 


Pro Met Gly 


Gin 


Tyr Gin 


Ala Asp Gin 


Tyr 


1 








5 






10 




15 


He 


His 


Arg 


Arg 


Ser 
20 


Ser Ser Arg 


Arg 


Glu Lys 

25' 


Gly Ala Glu 


Arg 
30 


He 


Leu 


Glu 


Glu 


lie 


Met Ala Glu 


Asn 


Phe Ser 


Ser Leu He 


Lys 










35 


He Gin Glu 




40 




45 


Asp 


Met 


Asn 


lie 


A,sn 
50 


Ala 


Gin Gin 
55 


Thr Pro Ser 


Met 
60 


Met 


Asn 


Ser 


Lys 


He 


Ala Thr Leu 


Arg 


His He 


He He Lys 


Leu 








65 




70 


Ala Ala Arg 


75 


Ser 


Lys 


Asp 


Lys 


His 


Arg Pro Leu 


Ser 


Leu Thr 


Ala 








80 






85 




90 


Pro 


Ser 


Leu 


Val 


Phe 
95 


Thr Ser Leu 


Phe 


Leu Leu 
100 


Leu Leu Glu 


Ala 
105 


Gin 


Pro 


Leu 


Trp 


Pro 
110 


Cys Asp Leu 


Gin 


Val Leu 
115 


Gly Asp Pro 


Leu 
120 


Leu 


Arg 


Cys 


Gin 


Asp 
125 


Pro Leu Glu 


Ala 









<210> 259 

<211> 93 1 • 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG : 1082774 . 1 . orf 3 : 2000FEB18 
<400> 259 



Pro 


Ala 


Gly 


He Arg 


Arg Val 


Thr Ala 


Arg 


Thr Pro Gly Pro 


Pro 


1 






5 






10 




15 


Gly 


Ser 


Leu 


Glu Met 


Gly Pro 


Leu Gin 


Phe 


Arg Asp Val Ala 


He 








20 




25 




30 


Glu 


Phe 


Ser 


Leu Glu 


Glu Trp 


His Cys 


Leu 


Asp Ala Ala Gin 


Arg 








35 


40 




45 


Asn 


Leu 


Tyr 


Arg Asp 


Val Met 


Leu Glu 


Asn 


Tyr Arg Asn Leu 


He 








50 






55 




60 


Phe 


Leu 


Gly 


He Val 


Val Ser 


Lys Pro 


Asn 


Leu He Thr Cys 


Leu 






65 




70 


75 


Glu 


Gin 


Gly 


Lys Lys 


Pro Leu 


Thr Met 


Lys 


Arg His Glu Met 


He 








, 80 






85 




90 


Ala 


Lys 


Pro 















<210> 260 
<211> 193 
<212> PRT 
<213> Homo sapiens 

<220> 
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<221> misc__feature 

<223> Incyte ID No: LG : 1082775 . 1 . orf 3 : 2000FEB18 
<400> 260 



Lys His 


Glu lie 


He 


His 


Phe 


Glu 


Glu 


Glu 


Pro 


Ser Glu Tyr Asn 


1 




5 










. 10 




15 


Asn Asn 


Gly Asn 


Ser 


Phe 


Tjtp 


Leu Asn 


GlU 


Asp 


Leu He Trp His 




20 








25 




30 


Gin Lys 


lie Lys 


Asn 


Trr> 


Glu 


Gin 


Pro 


Phe 


GlU 


Tyr Asn Glu Cys 






35 










40 




45 


Glv Lvs 


Ala Phe 


Pro 


Glu 


Asn 


Ser 


Leu 


Phe 


Leu 


Val His Lys Arg 




50 










55 




60 


Ala. I^r 


Thr Gly 


Gin 


Lys 


Thr 


Cys 


Lys 


Tvr 


Thr 


G lu His G ly Lys 






65 










70 




75 


Thr Cys 


Tyr Met 


Ser 


Phe 


Phe 


Tie 


Thf 
j. hi. 


His 


Gin 


Gin Thr His Pro 




80 










85 




90 


Arcs Glu 


Asn His 

noil i v-j 




Glu 


Cys 


Asn 


Glu 


Cys 


Glv 


Glu Ser He Phe 






95 










100 




105 


Glu GlU 


Ser He 


Leu 


Phe 


Glu 


His 


Gin 


Asn 


Val 


Tyr Pro Phe Ser 






110 










115 




120 


Gin Asn 


Leu Asn 


Pro 


Thr 


Leu 


He 


Gin 


Arg 


Thr 


His Ser He Ser 






125 










130 




135 


Asn lie 


lie Glu 


Tyr 


Asn 


Glu 


Cys 


Gly 


Thr 


Phe 


Phe Ser Glu Lys 






140 






145 




150 


Leu Ala 


Leu His 


Leu 


Gin 


Gin 


Arg 


Thr 


His 


Pro 


Gly Glu Lys Pro 






155 










160 




165 


Tyr Glu 


Cys His 


Glu 


Cys 


Gly 


Lys 


Thr 


Phe 


Thr 


Gin Lys Ser Ala 






170 










175 




180 


His Thr 


Arg His 


Gin 


Arg 


Thr 


His 


Thr 


Gly 


Lys 


Thr Leu 




185 










190 







<210> 261 
<211> ill 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: LG : 1083120 . 1 . orf 3 : 2000FEB18 
<400> 261 



Pro 


Gly 


Leu 


Arg 


Asp 


Leu 


Thr 


Cys 


Lys 


GlU 


Leu 


Leu 


He 


Leu 


Thr 


1 






5 










10 










15 


GlU 


Arg 


Glu 


Ala 


Gin 
20 


Lys 


Arg 


Lys 


Lys 


Arg 
25 


Lys 


Glu 


Lys 


<31u 


Ser 
30 


Gly 


Met 


Ala 


Leu 


Thr 


Gin Gly Pro 


Leu 


Thr 


Phe 


Arg Asp 


Val 


Ala 










35 










40 










45 


He 


GlU 


Phe 


Ser 


Gin 
50 


Glu 


Glu 


Trp 


Lys 


Ser 
55 


Leu 


Asp 


Pro 


Val 


Gin 
60 


Lys 


Ala 


Leu 


Tyr 


Trp 


Asp 


Val 


Met 


Leu 


Glu 


Asn 


Tyr 


Arg Asn 


Leu 










65 










70 










75 


Val 


Phe 


Leu 


Gly 


Lys 
80 


Asp 


Asn 


Phe 


Ala 


Leu 
85 


Glu 


Val 


Lys 


He 


Cys 
90 


Pro 


Arg 


Val 


Phe 


Leu 


Tyr 


Phe 


Leu 


Cys 


Cys 


Leu 


Ser 


Val 


Gly Ala 










95 








100 










105 


Arg 


Ser 


He 


His 


Leu 
110 


His 





















<210> 262 
<211> 137 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LG : 1087707 . 1 . orf 3 : 2000FEB18 
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<400> 262 








Leu 


His 


His 


Ser Pro 


Cys Tyr Pro Val Thr Cys 


Arg Tyr Trp Asp 


1 






5 


10 


15 


lie 


His 


Arg 


Glu Glu- 


Gly Gly Thr Ser Gly Gly 


Trp Glu Met Arg 








20 


25 


30 


Val 


Leu 


Thr 


Phe Arg 


Asp Val Ala Val Glu Phe 


Ser Pro Glu Glu 








35 


40 


45 


Trp 


Glu 


Cys 


Leu Asp 


Ser Ala Gin Gin. Arg Leu 


Tyr Arg Asp Val 








, 50 


55 


60 


Met 


Leu 


GlU 


Asn Tyr 


Gly Asn Leu Phe Ser Leu 


Gly Leu Ala He 








65 


70 


75 


Phe 


Lys 


Pro 


Asp Leu 


lie Thr Tyr Leu Glu Gin 


Arg ( Lys Glu Pro 








80 


85 


90 


Trp 


Asn 


Ala 


Arg Arg 


Gin Lys Thr Val Ala Lys 


His Pro Asp Tyr 








95 


100 


105 


Tyr 


Asp 


Val 


Cys Asn 


Glu Asp Tyr Glu ' Tyr Asn 


Trp Ser Tyr Met 








110 


115 


120 


Phe 


Leu 


Asn 


Ser Glu 


Gin Leu Phe He Lys Phe 


Tyr Pro Thr Phe 








125 


130 


135 


Phe 


Cys 











<210> 263 
<211> 68 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 1090915 . 1 . orf 3 :2000FEB18 
<400> 263 

Met Phe Pro Val Leu Glu Pro His Gin Val Gly Leu He Arg Ser 
1 5 10 15 

Tyr Asn Ser Lys Thr Met Thr Cys Phe Gin Glu Leu Val Thr Phe 

20 25 30 

Arg Asp Val Ala He Asp Phe Ser Arg Gin Glu Trp Glu Cys Leu 
* '35 40 45 

Asp Pro Asn Gin Arg Asp Leu Tyr . Arg Asp Val Met Leu Glu Asn 

50 55 60 

Tyr Arg Asn Leu Val Ser Leu Gly 

65 



<210> 264 
<211> 101 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc__feature 

<223> Incyte ID No: LG:1094230 .1 .orf3 :2000FEB18 
<400> 264 



Asp 


Thr Gly 


Thr Ser 


Trp 


Lys Pro Lys 


Met 


Gly 


Pro 


Leu Gin Phe 


1 




5 






10 






15 


Arg 


Asp Val 


Ala He 


Glu 


Phe Ser Leu 


Glu 


Glu 


Trp His Cys Leu 






20 






25 






30 


Asp 


Thr Ala 


Gin Arg 


Asn 


Leu Tyr Arg 


Asn 


Val 


Met 


Leu Glu Asn 






35 




40 






45 


Tyr 


Ser Asn 


Leu Val 


Phe 


Leu Gly He 


Thr 


Val 


Ser 


Lys Pro Asp 






50 




55 






60 


Leu 


He Thr 


Cys Leu 


Glu 


Gin Gly Arg 


Lys 


Pro 


Leu 


Thr Met Lys 






65 




70 






75 


Arg 


Asn Glu 


Met He 


Ala 


Lys Pro Ser 


Val 


Ser 


Phe 


Leu Gin Val 






80 




85 






90 


His 


Ser Glu 


Ser Gin 
95 


Ser 


Pro Leu His 


Asp 
100 


He 
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<2i0> 265 
<211> 96 
<212> PRT 

<213> Homo sapiens 

<220> ' 

<221> misc_f Mature 

<223> Incyte ID .No : LG:474848.3 *orf 1:2000FEB18 
<400> 265 



Ser 


Ala 


Ala 


Met 


Phe 


Pro Val 


Phe 


Ser 


Gly Cys 


Pfce Gin Glu Leu 


1 








5 








10 


15 


Gin 


Glu 


Lys 


Asn 


Lys 


Ser Leu 


Glu 


Leu 


Val Ser 


Phe Glii Glu Val 








20 








25 


30 


Ala 


Val 


His 


Phe 


Thr 


Trp Glu 


Glu Trp Gin Asp 


Leu Asp Asp Ala 










35 






40 


45 


Gin 


Arg 


Thr 


Leu 


Tyr 


Arg Asp 


Val 


Met 


Leu Glu 


Thr Tyr Ser Ser 










50 








55 


60 


Leu 


Val 


Ser 


Leu 


Gly 


His cys 


He 


Thr 


Lys Pro 


Glu Met He Phe 










65 






70 


75 


Lys 


Leu 


Glu 


Gin 


Gly 


Ala Glu 


Pro 


•Trp 


He Val 


Glu Glu Thr Leu 








80 








85 


90 


Asn 


Leu 


Arg Leu 


Sfcr 


Gly 


















95 













<210> 266 
<211> 251 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 251656 . 1 . orf 2 : 2000FEB01 
<220> 

<221> unsure < ; 
<222> 234 

<223> unknown or other 
<400> 266 



GlU 


Asn 


Gly 


Glu 


Asn 


Cys Asn 


Gin Asp 


Met 


Phe 


Glu 


Asn 


Glu 


Ser 


1 






5 




10 










15 


Arg 


Lys 


He 


Phe 


Ser 


Glu Met 


Pro Glu 


Gly 


Glu 


Ser 


Ala 


Gin 


His 






20 






25 










30 


Ser 


Asp 


Gly 


Glu 


Ser 
35 


Asp Phe 


Glu Arg 


Asp 
40 


Ala 


Gly 


He 


Gin 


Arg 
45 


Leu 


Gin 


Gly 


His 


Thr 


Pro Gly 


Glu Asp 


His 


Gly 


GlU 


Val 


Val 


Ser 








50 




55 










60 


Gin 


Asp 


Arg 


Glu 


Val 


Gly Gin 


Leu He 


Gly 


Leu 


Gin Gly Thr 


Tyr 






65 




70 










75 


Leu 


Gly 


Glu 


Lys 


Pro 


Tyr Glu 


Cys Pro 


Gin 


Cys 


Gly Lys 


Thr 


Phe 










80 






85 










90 


Ser 


Arg 


Lys 


Ser 


His 


Leu He 


Thr His 


Glu 


Arg 


Thr 


His 


Thr 


Gly 






95 






100 










105 


Glu 


Lys 


Tyr 


Tyr 


Lys 
110 


Cys Asp 


Glu Cys 


Gly 
115 


Lys 


ser 


Phe 


Ser 


Asp 
120 


Gly 


Ser 


Asn 


Phe 


Ser 


Arg His 


Gin Thr 


Thr 


His 


Thr Gly Glu 


Lys 










125 




130 










135 


Pro 


Tyr 


Lys 


Cys 


Arg 
140 


Asp Cys 


Gly Lys 


Ser 
145 


Phe 


Ser 


Arg 


Ser 


Ala 
150 


Asn 


Leu 


He 


Thr 


His 


Gin Arg 


He His 


Thr 


Gly 


GlU 


Lys 


Pro 


Phe 










155 




160 










165 


Gin 


Cys 


Ala 


GlU 


Cys 


Gly Lys 


Ser Phe 


Ser 


Arg 


Ser 


Pro 


Asn 


Leu 










170 




175 










180 


He 


Ala 


His 


Gin 


Arg 
185 


Thr His 


Thr Gly 


<31u 
190 


Lys 


Pro 


Tyr 


Ser 


Cys 
195 


Pro 


Glu 


Cys 


Gly 


Lys 


Ser Phe 


Gly Asn 


Arg 


Ser 


Ser 


Leu 


Asn 


Thr 
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ZOO 205 210 

His Gin Gly lie His Thr Gly Glu Lys Pro Tyr Glu Cys Lys Glu 

215 220 225 , 

Cys Gly Glu Ser Phe Ser Tyr Asn Xaa Asn Leu lie Arg Gin Gin ' 

230 , 235 240 

Arg lie His Thr Gly Glu Lys Pro Tyr Lys Cys 

245 250 

<210> 267 
<211> 522 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 021371 . 1 .orf 3 : 2000FEB01 
<400> 267 

Pro Gly Met Ser Val Ala Gly Val Glu Gly Glu Pro Leu Val Ser 

1 5 10 15 

Ser Gin Ser Gly Gin Ser Pro Pro Glu Pro Gin Asp Pro Glu Ala 

20 25 30 

Pro Ser Ser Ser Gly Pro Gly His Leu Val Ala Met Gly Lys Val 

35 40 45 

Ser Arg Thr Pro Val Glu Ala Gly Val Ser Gin Ser Asp Ala Glu 

50 55 60 

Asn Ala Ala Pro Ser Cys Pro Asp Glu His Asp Thr Leu Pro Arg 

65 70 75 

Arg Arg Gly Arg Pro Ser Arg Arg Phe Leu Gly Lys Lys Tyr Arg 

80 85 90 

Lys Tyr Tyr Tyr Lys Ser Pro Lys Pro Leu Leu Arg Pro Phe Leu 

95 100 105 

Cys Arg lie Cys Gly Ser Arg Phe Leu Ser His. Glu Asp Leu Arg 

110 115 120 

Phe His Val Asn Ser His Glu Ala Gly Asp Pro Gin Leu Phe Lys 

125 130 135 

Cys Leu Gin Cys Ser Tyr Arg Ser Arg Arg Trp Ser Ser Leu Lys 

140 145 150 

Glu His Met Phe Asn His Val Gly Ser Lys Pro Tyr Lys Cys Asp 

155 160 165 

Glu Cys Ser Tyr Thr Ser Val Tyr Arg Lys Asp Val lie Arg His 

170 175 180 

Ala Ala Val His Ser Arg Asp Arg Lys Lys Arg Pro Asp Pro Thr 

185 190 195 

Pro Lys Leu Ser Ser Phe Pro Cys Pro Val Cys Gly Arg Val Tyr 

200 205 210 

Pro Met Gin Lys Arg Leu Thr Gin His Met Lys Thr His Ser Thr 

215 220 , 225 

Glu Lys Pro His Met Cys Asp Lys Cys Gly Lys Ser Phe Lys Lys 

230 235 240 

Arg Tyr Thr Phe Lys Met His Leu Leu Thr His He Gin Ala Val 

245 250 255 

Ala Asn Arg Arg Phe Lys Cys Glu Phe Cys Glu Phe Val Cys Glu 

260 265 270 

Asp Lys Lys Ala Leu Leu Asn His Gin Leu Ser His Val Ser Asp 

275 280 285 

Lys Pro Phe Lys Cys Ser Phe Cys Pro Tyr Arg Thr Phe Arg Glu 

290 295 300 

Asp Phe Leu Leu Ser His Val Ala Val Lys His Thr Gly Ala Lys 

305 310 315 
Pro Phe Ala Cys Glu Tyr Cys His Phe Ser Thr Arg His Lys Lys 

320 325 330 

Asn Leu Arg Leu His Val Arg Cys Arg His Ala Ser Ser Phe Glu 

335 340 345 
Glu Trp Gly Arg Arg His Pro Glu Glu Pro Pro Ser Arg Arg Arg 

350 355 360 
Pro Phe Phe Ser Leu Gin Gin He Glu Glu Leu Lys Gin Gin His 
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BNSDOCID: <WO 0162927A2_I_> 



WO 01/62927 



Ser Ala Ala Pro Gly 
380 

lie Pro Pro Glu Ala 
395 

Leu Leu Cys Ser Asp 
410 

Gin Gly Ala Glu Glu 
425 

Asp Leu Leu Leu Asn 
440 

Ala Leu Gin Val Cys 
455 

Pro Gin Tyr Pro Ala 
470 

Ser Val Ser Glu Leu 
485 

Gly Asp Arg Ser Ala 
500 

Ala Leu Pro Arg Arg 
515 

<210> 268 
<211> 267 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_f eature 
<223> Incyte ID No: 

<220> 

<221> unsure 
<222> 36 
<223> unknown or other 

<400> 268 



Gly 


Leu Gly 


Glu 


Glu 


Val 


Pro 


Cys 


Ala 


Met 


Met 


Glu Gly Val 


Ala 


1 








5 










10 










15 


Ala 


Tyr 


Thr 


Gin 


Thr 


Glu 


Pro 


Glu 


Gly 


Ser 


Gin 


Pro 


Ser 


Thr 


Met 








20 










25 










30 


Asp 


Ala 


Thr 


Ala 


Val 


Xaa 


Gly 


He 


Glu 


Thr 


Lys 


Lys 


Glu 


Lys 


GlU 








35 










40 










45 


Asp 


Leu 


Cys 


Leu 


Leu 


Lys 


Lys 


Glu 


Glu 


Lys 


Glu 


Glu 


Pro 


Val 


Ala 






50 






55 










60 


Pro 


Glu 


Leu 


Ala 


Thr 
65 


Thr 


Val 


Pro 


Glu 


Ser 
70 


Ala 


Glu 

i 


Pro 


Glu 


Ala 
75 


Glu 


Ala 


Asp 


Gly Glu 


Glu 


Leu 


Asp 


Gly 


Ser 


Asp 


Met 


Ser 


Ala 


He 










80 










85 










90 


lie 


Tyr 


Glu 


lie 


Pro 


Lys 


Glu 


Pro 


GlU 


Lys 


Arg 


Arg 


Arg 


Ser 


Lys 










95 








100 










105 


Arg 


Ser 


Arg 


Val 


Met 


Asp 


Ala 


Asp 


Gly 


Leu 


Leu 


Glu 


Met 


Phe 


His 










110 










115 




Ala 






120 


Cys 


Pro 


Tyr 


Glu 


Gly 


Cys 


Ser 


Gin 


Val 


Tyr 


Val 


Leu 


Ser 


Ser 








125 








130 










135 


Phe 


Gin 


Asn 


His 


Val 
140 


Asn 


Leu 


val 


His 


Arg 
145 


Lys 


Gly 


Lys 


Thr 


Lys 
150 


Val 


Cys 


Pro 


His 


Pro 


Gly 


Cys 


Gly Lys 


Lys 


Phe 


Tyr 


Leu 


Ser 


Asn 








155 








160 










165 


His 


Leu Arg 


Arg 


His 


Met 


He 


He 


His 


Ser 


Gly 


Val 


Arg Glu 


Phe 










17 0 










175 










180 


Thr 


Cys 


Glu 


Thr 


Cys 


Gly 


Lys 


Ser 


Phe 


Lys 


Arg 


Lys 


Asn 


His 


Leu 








185 








190 










195 


Glu 


Val 


His 


Arg 


Arg 


Thr 


His 


Thr 


Gly 


GlU 


Thr 


Pro 


Leu <31n 


Cys 










200 










205 










210 


Glu 


lie 


Cys 


Gly Tyr 


Gin 


Cys 


Arg Gin 


Arg 


Ala 


Ser 


Leu 


Asn 


Trp 



215 220 225 

139/228 
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370 










375 


Pro 


Pro Pro 


Ser 


Ser 


Pro 


Gly Pro 


Pro 


GlU ' 








385 










390 


Thr 


Thr Phe 


Gin 


Ser 


Ser 


Glu 


Ala 


Pro 


Ser 








400 










405 


Thr Leu Gly Gly 


Ala 


Thr 


He 


He Tyr Gin 








415 










420 


Ser 


Thr Ala 


Met 


Ala 


Thr 


Gin 


Thr 


Ala 


Leu 








430 










435 


Met 


Ser Ala 


Gin 


Arg 


Gly 


Pro Gly Gly Thr 








445 










450 


Cys 


Leu Gly Thr 


Cys 


Ser 


Pro 


Ser 


Gin 


Leu 






460 










465 


Leu 


His Trp 


Thr 


Leu 


Gly 


Leu 


Glu 


Glu 


Asn 






475 










480 


Leu Arg Pro 


Trp 


Gly 


Leu 


Pro 


Gly 


Ser 


Gly 








490 










495 


Glu 


Val Trp 


Trp 


Ala 


Asn 


Arg 


Glu 


Glu 


Gin 






505 










510 


Pro 


Gin Gly 


He 


Pro 


Ser 


He 












520 













LI:133 095.1.orf2 :2000FEB01 



WO 01/62927 



PCT/US01/06059 



His Met Lys Lys His Thr Ala Glu Val Gin Tyr Asn Phe Thr Cys 
230 , 235 240 

Asp Arg Cys Gly Lys Arg Phe Glu Lys Leu Asp Ser Val Lys Phe 
245 - 250 255 

His Thr Leu Lys Ser His Pro Asp His Lys Pro Thr 
260 265 

<210> 269 
<211> 286 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 23 6654 .2 . orf 2 :2000FEB01 
<400> 269 



Arg 


Pro 


Leu 


Pro 


Ala 


Asp 


Leu 


Pro 


Val 


Gly 


Gly 


His His 


Cys Leu 


1 








5 








10 






15 


His 


Gly 


Pro 


Gin Glu 


Ala 


Gly 


Leu 


Ser 


Ala 


Leii 


Gin Arg 


Pro Gin 










20 








25' 






30 


Pro 


Arg 


Pro 


Gly Leu 


Arg 


Thr 


Arg 


Gly 


Ala 


Glu 


Gly Leu 


Glu Leu 










35 








40 






45 


Pro 


Ala 


Leu 


Trp 


Gin 


Thr 


Val 


His 


Ser 


Gly 


Leu 


Glu Ala 


Ala Ala 








50 










55 






60 


Ser 


Arg 


Pro 


Val Gly 


Pro 


Arg 


Thr 


Val 


His 


Leu 


Pro Asp 


Arg He 










65 










70 






75 


Arg 


Gly 


Pro 


Gly Gly 


Pro 


Val 


Leu 


Gly 


Leu 


Ala 


Glu Val 


Ala Ala 










80 










85 






90 


Ala 


Val 


Ser 


Ala 


Val 


Val 


Gly 


Pro 


Ala 


Ala 


Glu 


Ala Lys 


Ser Pro 










95 








100 






105 


Arg 


Ala 


Ser 


Gly Ser 


Gly 


Leu 


Thr 


Arg 


Arg 


Ser 


Pro Pro 


Val Leu 










110 










115 






120 


Cys 


Ala 


Arg 


Arg 


Pro 


Ser 


Ala 


Pro 


Ser 


Ala 


Thr 


Ser Lys 


Cys Thr 










125 










130 






135 


Cys 


Ala 


His 


Thr 


Gin 


Ala 


Ser 


Gly 


Pro 


Met 


Leu 


Ala Thr 


Ser Val 










140 








145 






150 


Pro 


Thr 


Pro 


Ala 


Pro 


Arg 


Ala 


Ala 


Ser 


Ser 


Thr 


Ala Thr 


Arg Arg 










155 










160 






165 


Pro 


Thr 


Gly 


Arg 


Cys 


Arg 


Pro 


Arg 


Ala 


Pro 


Ser 


Trp Pro 


Thr Pro 










170 










175 






180 


Ala 


Arg 


Ser 


Arg 


Pro 


Leu 


Gin 


Pro 


Leu 


Arg 


Ser 


Arg Leu 


Ser Met 










185 










190 






195 


Leu 


Leu 


Pro 


Pro 


Pro 


Ala 


Pro 


Phe 


His 


Ala 


Ala 


Val Val 


Arg Gly 










200 










205 






210 


Leu 


Glu 


Pro 


Pro 


Pro 


Gin 


Gin 


Val 


Ser 


Arg 


Asn 


Pro Gly 


Leu Leu 










215 










220 






225 


Ala 


Val 


Gly 


Leu 


Lys 


Pro 


Ala 


Leu 


Val 


Glu 


Thr 


Leu Gly 


Glu Pro 










230 










235 






240 


Ser 


Pro 


Arg 


Asn 


Lys 


Glu 


Leu 


Thr 


Leu 


Gin 


Thr 


Ala Axg 


Arg His 










245 










250 






255 


His 


Pro 


Lys 


Arg 


Cys 


Pro 


Ser 


Gin 


Gly 


Ala 


Arg 


Ala Ala 


Gly Pro 










260 










265 






270 


Gly 


Ala 


Ala 


Val 


Ser 


Ser 


Ala 


Gly 


Ser 


He 


Leu 


Pro Thr 


Ala Ala 



275 280 285 

Thr 



<210> 270 
<211> 194 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 200009 . 1 .orf 3 :2000FBB01 
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<400> 270 




















Gly 


Leu 


Ser Pro 


Lvs, 


Ala 


Ala 


Asn 


Leu 


Ala 


Pro 


Thr Thr Gin 


Gin 


1 






5 










10 






15 


Arg 


Ser 


Val Val 


Phe 


Pro 


Gin 


Thr 


Pro 


Cys 


Ser 


Arg Asn Phe 


Ser 




Asp Lys 


20 










25 






30 


Leu 


Leu 


Ser 


Gly Pro 


He Glu Ser Gly 


Phe Asn Gin 


He 






35 










40 






45 


Asn 


Val 


Lys Asn 


Gin 


Arg Val 


Leu 


Ala 


Ser 


Pro 


Thr Ser Thr 


Ser 






50 










55 






60 


Gin 


Leu 


His Ser 


Glu 


Phe 


Ser 


Asp 


Trp 


His 


Leu 


Trp Lys Cys Gly 








65 










70 






75 


Gin 


Cvs 


Phe Lys 


Thr 


Phe 


Thr Gin Arg 


He 


Leu 


Leu Gin Met 


His 




80 










85 






90 


Val 


Cys 


Thr Gin 


Asn 


Pro 


Asp 


Arg 


Pro Tyr 


Gin 


Cys Gly His 


Cys 






95 










100 






105 


Ser 


Gin 


Ser Phe 


Ser 


Gin 


Pro 


Ser 


Glu 


Leu 


Arg 


Asn His Val 


Val 








110 










115 




120 


Thr 


His 


Ser Ser 


Asp 


Arg 


Pro 


Phe 


Lys 


Cys 


Gly 


Tyr Cys Gly Arg 








125 










130 






135 


Ala 


Phe 


Ala Gly 


Ala 


Thr 


Thr 


Leu 


Asn 


Asn 


His 


He Arg Thr 


His 






140 










145 






150 


Thr 


Gly 


Glu Lys 


Pro 


Phe 


Lys 


Cys 


Glu 


Arg 


Cys 


Glu Arg Ser 


Phe 




155 










160 






165 


Thr 


Gin 


Ala Thr 


Gin 


Leu 


Ser 


Arg 


His 


Gin Arg 


Met Pro Asn 


Glu 








170 










175 






180 


Cys 


Lys 


Pro lie 


Thr 


Glu 


Ser 


Pro 


Glu 


Ser 


He 


Glu Val Asp 






185 










190 









<210> 271 
<211> 263 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: LI : 758502 . 1 . orf 3 :2000FEB01 
<400> 271 



Thr 


Leu 


He 


Lys 


His 


Gin 


Arg 


Thr 


His 


Thr 


Gly 


Glu Arg Pro Tyr 


1 






5 










10 






15 


GlU 


Cys 


Pro 


Glu 


Cys 
20 


Gly 


Lys 


Thr 


Phe 


Gly 
25 


Arg 


Lys 


Pro His Leu 
30 


He 


Met 


His 


Gin 


Arg 


Thr 


His 


Thr Gly 


GlU 


Lys 


Pro Tyr Ala Cys 










35 










40 






45 


Leu 


Glu 


Cys 


His 


Lys 


Ser 


Phe 


Ser 


Arg 


Ser 


Ser 


Asn 


Phe He Thr 








50 










55 






60 


His 


Gin 


Arg 


Thr 


His 


Thr 


Gly 


Val 


Lys 


Pro 


Tyr 


Arg 


Cys Asn Asp 








65 










70 






75 


Cys 


Gly 


Glu 


Ser 


Phe 


Ser 


Gin 


Ser 


Ser 


Asp 


Leu 


He 


Lys His Gin 






80 










85 






90 


Arg 


Thr 


His 


Thr 


Gly 
95 


Glu 


Arg 


Pro 


Phe 


Lys 
100 


Cys 


Pro 


Glu Cys Gly 
105 


Lys 


Gly 


Phe 


Arg 


Asp 


Ser 


Ser 


His 


Phe 


Val 


Ala 


His 


Met Ser Thr 




110 










115 






120 


His 


Ser 


Gly 


Glu 


Arg 


Pro 


Phe 


Ser 


Cys 


Pro 


Asp 


Cys 


His Lys Ser 








125 










130 






135 


Phe 


Ser 


Gin 


Ser 


ser 
140 


His 


Leu 


Val 


Thr 


His 
145 


Gin 


Arg 


Thr His Thr 
150 


Gly 


Glu 


Arg 


Pro 


Phe 
155 


Lys 


Cys 


Glu 


Asn 


Cys 
160 


Gly 


Lys 


Gly Phe Ala 
165 


Asp 


Ser 


Ser 


Ala 


Leu 
170 


Thr 


Lys 


His 


Gin 


Arg 
175 


He 


His 


Thr Gly Glu 
180 


Arg 


Pro 


Tyr 


Lys 


Cys 
185 


Gly 


Glu 


Cys 


Gly 


Lys 
190 


Ser 


Phe 


Asn Gin Ser 
195 


Ser 


His 


Phe 


He 


Thr 
200 


His 


Gin 


Arg 


He 


His 
205 


Leu 


Gly 


Asp Arg Pro 
210 


Tyr 


Arg 


Cys 


Pro 


Glu 


Cys 


Gly 


Lys 


Thr 


Phe 


Asn 


Gin 


Arg Ser His 
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L5 220 225 

Phe Leu Thr His Gin Arg Thr His Thr Gly Glu Lys Pro Phe His 
230 235 240, 

Cys Ser Lys Cys Asn Lys Ser Phe Arg Gin Lys Ala His Leu Leu' 
245 , 250 255 

Cys His Gin Asp Thr His Leu lie 
260 

<210> 272 
<211> 142 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI :344772 . 1 .orf 2 :2000FEB01 
<220> 

<221> unsure 
<222> 142 

<223> unknown or other 



<400> 272 



Ala 


Lys 


Asn 


Leu 


Phe 


Lys Met 


1 








5 




Ser 


Tyr 


Val 


Ser 


Gly 


Ser Gly 










20 




Leu 


Gly 


Ala 


Pro 


Val 


Gly Pro 










35 




His 


Ser 


Ser 


Pro 


Ser 


Arg Ser 










50 




Glu 


Met 


Tyr 


Ser 


Asp 


Glu Glu 










65 




GlU 


Arg 


Leu 


Leu 


Glu 


Lys Asp 










80 




Leu 


Ser 


Glu 


Pro 


Leu 


Gly Tyr 










95 




His 


Ser 


Pro Gly 


Gly 


lie Arg 










110 




Asp 


Val 


Cys 


Gly 


Met 


Val Cys 










125 




His 


Lys 


Arg Ser 


His 


Thr Xaa 



140 



Asp 


lie Glu Asp Cys Asn 


Gly Arg 




10 




15 


Asp 


Ser Ser Leu Glu Lys 


Glu 


Phe 


25 




30 


Ser 


Val Ser Thr Pro Asn 


Ser 


Gin 




40 




45 


Leu 


Ser Ala Asn Ser lie 


Lys 


Val 




55 


60 


Ser 


Ser Arg Leu Leu Gly 


Pro 


Asp 




70 




75 


Asp 


Ser Val lie Val Glu 


Asp 


Ser 




85 




90 


Cys Asp Gly Ser Gly Pro 


Glu 


Pro 




100 




105 


Leu 


Pro Asn Gly Lys Leu 


Lys 


Cys 




115 




120 


lie Gly Pro Asn Val Leu 


Met 


Val 




130 




135 



<210> 273 
<211> 164 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI :789445 . 1 .orf 2 :2000FEB01 
<400> 273 



GlU 


His 


Arg 


Glu 


Ala 


Lys 


Ala 


Ser 


Gly 


Trp 


Val 


Thr Asp 


Gly Leu 


1 








5 










10 






15 


Leu 


Met 


Asp 


Ser 


Ser 


Gin 


His 


Leu 


Val 


Thr 


Phe 


Glu Asp 


Val Ala 










20 










25 






30 


Val 


Asp 


Phe 


Thr 


Gin 


Glu 


Glu 


Trp 


Thr 


Leu 


Leu 


Asp Gin 


Ala Gin 










35 








40 






45 


Arg 


Asp 


Leu 


Tyr 


Arg 


Asp 


Val 


Met 


Leu 


Glu 


Asn 


Tyr Lys 


Asn Leu 










50 










55 






60 


He 


He 


Leu 


Ala 


Gly 


Ser 


Glu 


Leu 


Phe 


Lys 


Arg 


Ser Leu 


Met Ser 










65 










70 






75 


Gly 


Leu 


Glu 


Gin 


Met 


GlU 


Glu 


Leu 


Arg 


Thr 


Gly 


Val Thr 


Gly Val 










80 










85 






90 


Leu 


Gin 


Glu 


Leu 


Asp 


Leu 


Gin 


Leu 


Lys 


Thr 


Lys 


Gly Ser 


Pro Leu 
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95 1 100 105 

Leu Gin Asp He Ser, Ala Glu Arg Ser Pro Asn Gly Val Gin Leu 

110 115 120 

Glu Arg Ser Asn Thr Ala Glu Lys Leu Tyr Asp Ser Asn His ser 

125 13 0 135 

Gly Lys Val Phe Asn Glu His Pro Phe Leu Met Thr His Met lie 

140 145 150 

Thr His He Gly , Glu Lys Thr Ser Glu Asp Asn Gin Ser Gly 

155 160 

<210> 274 
<211> 107 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> incyte ID No: LI : 789657 . 1 . orf 2 : 2000FEB01 
<220> 

<221> unsure 

<222> 5 * 
<223> unknown or other 

<400> 274 



Met 


Trp 


Gin 


Val 


Xaa 


Ser 


Lys 


Ser 


Ser 


His 


Leu 


Ala Val His 


Gin 


1 








5 








10 






15 


Arg 


He 


His 


Thr 


Gly 


Glu Lys 


Pro 


tyr 


Lys 


Cys 


Asn Arg Cys Gly 










20 










25 






30 


Lys 


Cys 


Phe 


Ser 


Gin 


Ser 


Ser 


Ser 


Leu 


Ala 


Arg 


His Gin Thr 


Val 






35 










40 






45 


His 


Thr 


Gly Glu 


Lys 


Pro 


Tyr 


He 


Cys 


Ala 


Glu 


Cys Gly Lys 


Ala 










50 










55 






60 


Phe 


Ser 


Gin 


Lys 


Ser 


Asp 


Leu 


Val 


Val 


His 


Gin 


He He His 


Thr 








65 








70 






75 


Gly 


Glu 


Lys 


Pro 


Asp 
80 


Arg 


Cys 


Thr 


Val 


Cys 
85 


Gly 


Lys Ala Phe 


He 
90 


Gin 


Lys 


Ser 


Gin 


Leu 


Thr 


Val 


His 


Gin 


Arg 


He 


His Thr Leu 


Met 








95 










100 






105 


Lys 


Ser 

























<210> 275 
<211> 105 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_ feature 

<223> Incyte ID No: LI :789808 . 1 .orf 3 : 2000FEB01 
<400> 275 



Glu 


His 


Thr 


Asp 


Gly 


Lys 


Ser 


Tyr 


Ala 


Cys 


He Gin Cys Gly Lys 


1 








5 










10 


15 


Phe 


Phe 


Cys 


Cys 


Tyr 


Tyr 


Ser 


Phe 


Thr 


Glu 


His Leu Arg Arg His 










20 










25 


30 


Thr 


Gly 


Glu Lys Pro 


Phe 


Gly 


Cys 


Asn 


Glu 


Cys Gly Lys Thr Phe 










35 










40 


45 


His 


Gin 


Lys 


Leu 


Ala 


Leu 


He 


Val 


His 


Gin 


Arg Thr His He Arg 








50 










55 


60 


Gin 


Lys 


Pro 


Tyr 


Gly 


Cys 


Asn 


Glu 


Cys 


Gly 


Lys Ser Phe Cys Val 










65 










70 


75 


Lys 


Ser 


Lys 


Leu 


He 


Ala 


His 


His 


Arg 


Thr 


Tyr Thr Gly Glu Lys 










80 










85 


90 


Pro 


Tyr 


Glu 


Cys 


Asn 


Val 


Cys 


Gly Lys 


Leu 


Leu Leu Ser -Gin Asn 








95 








100 


105 
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<210> 276 
<211> 149 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI.-792919 .1 .orf 1 .-2000FEB01 
<400> 276 

His Gin Met He His Met Gly Gin Asn Pro Tyr Asn Cys Lys Glu 
15 10 15 

Cys Gly Lys Ser Phe Lys Trp Ser Ser Tyr Leu Leu Val His Gin 

20 * 25 30 

Arg Val His Thr Gly Glu Lys Pro Tyr Lys Cys Glu Glu Cys Gly 

35 ~ 40 45 

Lys Gly Tyr He Ser Lys Ser Gly Leu Asp Phe His His Arg Thr 

50 55 60 

His Thr Gly Glu Arg Ser Tyr Asn Cys Asp Asn Cys Gly Lys Ser 

65 ' 70 75 

Phe Arg His Ala Ser Ser He Leu Asn His Lys Lys Leu His Cys 

80 85 90 

Gin Arg Lys Pro Leu Lys Cys Glu Asp Cys Gly Lys Arg Leu Val 

95 "* 100 105 

Cys Arg Ser Tyr Cys Lys Asp Gin Gin Arg Asp His Ser Gly Glu 
110 115 120 

Asn Pro Ser Lys Cys Glu Asp Cys Gly Lys Arg Tyr Lys Arg Arg 
125 130 135 

Leu Asn Leu Asp He He Leu Ser Leu Phe Leu Asn Asp He 
140 145 

<210> 277 
<211> 101 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI :793949 . 1 .orf 3 : 2000FEB01 



<400> 277 



Asp 


Thr 


Gly Thr 


Ser 


Trp Lys 


1 








5 






Arg 


Asp 


Val 


Ala 


He 
20 


Asp 


Phe 


Asp 


Thr 


Ala 


Gin 


Arg 
35 


Asp 


Leu 


Tyr 


Ser 


Asn 


Leu 


Val 
50 


Phe 


Leu 


Val 


He 


Ser 


Ser 


Leu 
65 


GlU 


Gin 


Arg 


Asn 


Glu 


Met 


He 
80 


Ala 


Lys 


His 


Ser 


Glu 


Ser 


Gin 
95 


Ser 


Pro 



<210> 278 
<211> 137 
<212> PRT 
<213> Homo sapiens 



Pro Lys Met 


Gly 


Pro 


Leu 


Gin 


Phe 


10 








15 


Ser Gin Glu 


Glu 


Trp 


His 


Cys 


Leu 


25 










30 


Tyr Arg Cys 


Val 


Met 


Leu 


Glu 


Asn 


40 










45 


Gly He Thr 


Val 


Ser 


Lys 


Pro 


Asp 


55 










60 


Gly Arg Lys 


Pro 


Leu 


Thr 


Met 


Lys 


70 










75 


Pro Ser Val 


Ser 


Phe 


Leu 


Gin 


Val 


85 










90 


Leu His Asp 


He 











100 



<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 7943 89 . 1 .orf 3 : 2000FEB01 
<220> 

<221> unsure 
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<222> 23 

<223> unknown or other 
<400> 278 



Gly Leu 


Gin Lys 


Thr 


Phe Cys 


Arg 


Val 


Met 


Gin Phe Thr Leu His 


1 


• • 


5 




10 


15 


Arg> 


Arg 


lie His 


Thr 


Gly Glu 


Xaa 


Pro 


Tyr 


Glu Cys Lys Glu Cys 




20 






25 


30 


Gly 


Lys 


Ser Phe 


Ser 


Ala His 


Ser 


Ser 


Leu 


Val Thr His Lys Arg 




35 








40 


45 


Thr 


His 


Ser Gly 


GlU 


Lys Pro 


Tyr 


Lys 


Cys 


Lys Glu Cys Gly Lys 








50 








55 


60 


Ala 


Phe 


Ser Ala 


His 


Ser Ser 


Leu 


Val 


Thr 


His Lys Arg Tnr His 








65 








70 


75 


Ser 


Gly 


Glu Lys 


Pro 


Tyr Thr 


Cys 


His 


Ala 


Cys Gly Lys Ala Phe 




80 






85 


90 


Asn 


Thr 


Ser Ser 


Thr 


Leu Cys 


Gin 


His 


Asn 


Arg He His Thr Gly 








95 






100 


105 


Glu 


Lys 


Pro Phe 


Gin 


Cys Ser 


Gin 


Cys 


Gly 


Lys Ser Phe Ser Cys 






110 






115 


120 


Ser 


Ser 


His Leu 


Thr 


Arg His 


Cys 


Arg Met 


Cys Asn Gly Lys Phe 








125 






13 0 


135 


Ser 


Lys 

















<210> 279 

<211> 97 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc__f eature 

<223> Incyte ID No: LI :79 6010 . 1 . orf 3 : 2000FEB01 
<220> 

<221> unsure 

<222> 4, 18, 23 

<223> unknown or other 

<400> 279 



Leu 


Cys 


He 


Xaa 


Lys 


His 


Thr 


Gly 


Glu 


Lys 


Pro 


Tyr 


GlU 


Cys 


Tyr 


1 






5 










10 










15 


Ala 


Cys 


Xaa 


Asn 


Thr 


Phe 


Leu 


Xaa 


Lys 


Ser 


Asp 


Leu 


He 


Lys 


His 








20 










25 










30 


Gin 


Arg 


He 


His 


Thr 


Gly 


Glu 


Lys 


Pro 


Tyr 


GlU 


Cys 


Asn 


Glu 


Cys 








35 










40 










45 


Gly 


Lys 


Ser 


Phe 


Ser 


Glu 


Lys 


Ser 


Thr 


Leu 


Thr 


Lys 


His 


Leu 


Arg 






50 








55 










60 


Thr 


His 


Arg 


Trp 


Glu 


He 


Leu 


Cys 


Met 


Tyr 


Ser 


Met 


Trp 


Lys 


He 








65 








70 










75 


Phe 


Leu 


Leu 


Leu 


Leu 


Gin 


Phe 


His 


Arg 


Thr 


Ser 


Glu Lys 


Thr 


His 










80 










85 










90 


Arg 


Gly 


Glu 


Thr 


Phe 
95 


Trp 


Met 



















<210> 280 

<211> 97 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: LI : 796324 . 1 . orf 2 : 2000FEB01 
<220> 

<221> unsure 
<222> 93 
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<223> unknown or other 
<400> 280 

Leu cys lie Arg Lys' His Thr Gly Glu Lys Pro Tyr Glu Cys Tyr 
15 10 15 

Ala Cys Gly Asn Thr Phe Leu Arg Lys Ser Asp Leu lie Lys His 

20 .25 30 

Gin Arg lie His Thr Gly Glu Lys Pro- Tyr Glu Cys Asn Glu Cys 
• 35 40 45 

Gly Lys Ser Phe ' Ser Glu Lys Ser Thr Leu Thr Lys His Leu Arg 

50 55 60 

Thr His Arg Trp Glu lie Leu Cys Met Tyr Ser Met, Trp Lys He 

65 70 75 

Phe Leu Leu Leu Leu Gin Phe His Arg Thr Ser Glu Lys Thr His 

80 ,85 90 

Arg Gly Xaa Thr Phe Trp Met 

<210> 281 
<211> 179 
<212> PRT 
<213> Homo sapiens 

<220> , 
<221> misc_feature 

<223> Incyte ID No: LI:796373 .l.orf 1:2000FEB01 
<400> 281 

Met Trp Glu Ser Phe Ser Gin Lys Thr Cys Leu He Ser His Gin 

15 10 15 

Arg Phe His Thr Gly Lys Thr Pro Phe Val Cys Thr Glu Cys Gly 

20 25 30 

Lys Ser Cys Ser His Lys Ser Gly Leu He Asn His Gin Arg He 

35 40 45 

His Thr Gly Glu Lys Pro Tyr Thr Cys Ser Asp Cys Gly Lys Ala 

50 55 60 

Phe Arg Asp Lys Ser Cys Leu Asn Arg His Arg Arg Thr His Thr 

65 70 75 

Gly Glu Arg Pro Tyr Gly Cys Ser Asp Cys Gly Lys Ala Phe Ser 

80 85 90 

His Leu Ser Cys Leu Val Tyr His Lys Gly Met Leu His Ala Arg 

95 100 105 

Glu Lys Cys Val Gly Ser Val Lys Leu Glu Asn Pro Cys Ser Glu 

110 115 120 

Ser His Ser Leu. Ser His Thr Arg Asp Leu He Gin Asp Lys Asp 

125 130 135 

Ser Val Asn Met Val Thr Leu Gin Met Pro Ser Val Ala Ala Gin 

140 145 150 

Thr Ser Leu Thr Asn Ser Ala Phe Gin Ala Glu Ser Lys Val Ala 

155 160 165 

He Val Ser Gin Pro Val Ala Arg Ser Ser Val Ser Ala Asp 

170 175 

<210> 282 
<211> 87 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 79 6415 . 1 .orf 3 :2000FEB01 
<400> 282 

Lys His Glu He He His Phe Glu Glu Glu Pro Ser Glu Tyr Asn 

15 10 15 

Asn Asn Gly Asn Ser Phe Trp Leu Asn Glu Asp Leu He Trp His 

20 25 30 
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Gin Lys He Lys Asn Trp Glu Gin Pro Phe Glu Tyr Asn Glu Cys 

35, 40 45 

Gly Lys Ala Phe Pro Glu Asn Ser Leu Phe Leu Val His Lys Arg 

50 55 60 

Ala Tyr Thr Gly Gin Lys Thr cys Lys Tyr Thr Glu His Gly Lys 

,65 70 75 

Thr Cys Tyr Met Ser Phe Phe He Thr His Gin Gin 

, 80 85 

<210> 283 
<211> 172 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 798636 . 1 . orf 2 : 2 000FEB01 
<400> 283 



Asn 


Glu 


Cys Gly 


Lys 


Ala 


Leu 


Ser 


Ser 


His 


Ser Thr Leu He 


He 


1 




5 










10 




15 


His 


Glu 


Arg He 


His 


Thr 


Gly 


Glu 


Lys 


Pro 


Cys Lys Cys Lys 


Val 








20 








25 




30 


Cys 


Gly 


Lys Ala 


Phe 


Arg 


Gin 


Ser 


Ser 


Ala 


Leu IJ.© Gin His 


Gin 


35 










40 




45 


Arg 


Met 


His Thr 


Gly 


Glu 


Arg 


Pro 


Tyr 


Lys 


Cys Asn Glu Cys 


Asp 






50 










55 




60 


Lys 


Thr 


Phe Arg 


Cys 


Asn 


Ser 


Ser 


Leu 


Ser 


Asn His Gin Arg 


He 




65 










70 




75 


His 


Thr 


Gly Glu 


Lys 


Pro 


Tyr 


Arg 


Cys 


Leu 


Glu Cys Gly Met 


Ser 








80 










85 




. 90 


Phe 


Gly 


Gin Ser 


Ala 


Ala 


Leu 


He 


Gin 


His 


Gin Arg He His 


Thr- 






95 










100 




105 


Gly 


Glu 


Lys Pro 


Phe 


Lys 


Cys 


Asn 


Thr 


Cys 


Gly Lys Thr Phe 


Arg 








110 










115 




120 


Gin 


Ser 


Ser Ser 


Leu 


He 


Ala 


His 


Gin 


Arg 


He His Thr Gly Glu 








125 










13 0 




135 


Lys 


Pro 


Tyr Glu 


Cys 


Asn 


Ala 


Cys 


Gly 


Lys 


Leu Phe Ser Gin 


Arg 






140 










145 




150 


Ser 


Ser 


Leu Thr 


Asn 


His 


Tyr 


Lys 


He 


His 


He Glu Glu Asp 


Ser 








155 






160 




165 


Leu 


Lys 


Ala Asp 


Leu 


His 


Val 













170 

<210> 284 
<211> 151 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 800045 . 1 . orf 1 : 2000FEB01 
<220> 

<221> unsure 
<222> 104 

<223> unknown or other 
<400> 284 



Lys He Met His Thr 


Gly 


Glu 


Lys 


Arg 


Tyr Glu Cys Asp Asp 


Cys 


1 5 










10 


15 


Gly Gly Thr Phe Arg 


Ser 


Ser 


Ser 


ser 


Leu Arg Val His Lys 


Arg 


20 










25 


30 


He His Thr Gly Glu 


Lys 


Pro 


Tyr 


Lys 


Cys Glu Glu cys Gly Lys 


35 










40 


45 


Ala Tyr Met Ser Tyr 


Ser 


Ser 


Leu 


He 


Asn His Lys Ser Thr 


His 


50 










55 


60 
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Opr 


G5 v 


Gin TiAyc; 




C\ra T.\TG <"*ytc 2Vct> 
\~yo Lijfst v^jfo nop 




p\rc ClV IjVS Car PHo 












/ u 


/ D 




ryr 


i)ci Oei 


Val 


Leu Asp Gin His 




rvtry lie nio xiixr v^iy , 








fin 




ft c; 


on 


Pi i-i 


L»ys 


fio lyi 


VaJLU 


p»rpi 1 *r "1 »i pirn 

uys V3j.y biu Lys 




Jjys nla rlic Aaa ASH 








95 


100 


105 


Ser 


Ser 


Gly Leu 


Arg 


Val His Lys Arg 


lie 


His Thr Gly Glu Lys 








110 




115 


120 


Pro 


Tyr 


Glu Cys 


Asp 


lie Cys Gly Lys 


Thr 


Phe Ser Asn Ser Ser 








125 




130 


135 


Gly 


Leu 


Thr Val 


His 


Lys Arg lie His 


Thr 


Val Ser Asp Glu Leu 








140 


145 


150 


Pro 















<210> 285 

<211> 89 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 800680 . 1 .orf 2 :2000FEB01 
<400> 285 



Leu 


Thr 


Tyr Leu 


Arg 


Lys 


Lys Leu Arg Gly 


Arg 


Gly Lys Lys Glu 


1 






5 




. 10 




15 


Glu 


Glu 


Gly Met 


Ala 


Leu 


Ser Gin Gly Leu 


Phe 


Thr Phe Lys Asp 








20 




25 




30 


Val 


Ala 


lie Glu 


Phe 


Ser 


Gin Glu Glu Trp 


Glu 


Cys Leu Asp Pro 








35 




40 




45 


Ala 


Gin 


Arg Ala 


Leu 


Tyr 


Arg Asp Val Met 


Leu 


Glu Asn Tyr Arg 








50 




55 




60 


Asn 


Leu 


Leu Ser 


Leu 


Asp 


Glu Asp Asn lie 


Pro 


Pro Glu Asp Gly 








65 


70 




75 


Ser 


His 


Leu Ala 


Ala 


Cys 


Gly Gin Ser Thr 


Leu 


Pro Leu Pro 








80 


85 







<210> 286 
<211> 146 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 800894 . 1 .orf 2 :2000FEB01 
<400> 286 



Pro 


Ala Gly lie Gly 


Arg Ser Thr Thr 


Lys 


Ser 


Pro Gly Pro Pro 


1 


5 




10 




15 


Gly 


Ser Leu Glu Met 


Gly Ser Leu Thr 


Phe 


Arg 


Asp Val Ala He 




20 


25 




30 


Glu 


Phe Ser Leu Glu 


Glu Trp Gin Cys 


Leu 


Asp 


Thr Ala Gin Gin 




35 


40 




45 


Asn 


Leu Tyr Arg Asn 


Val Met Leu Glu 


Asn 


Tyr 


Arg Asn Leu Val 




50 




55 




60 


Phe 


Leu Gly lie Ala 


Ala Phe Lys Pro 


Asp 


Leu 


He He Phe Leu 




65 




70 




75 


Glu 


Glu Gly Lys Glu 


Ser Trp Asn Met 


Lys 


Arg 


His Glu Met Val 




80 




85 




90 


Glu 


Glu Ser Pro Val 


lie Cys Ser His 


Phe 


Ala 


Gin Asp Leu Trp 




95 




100 




105 


Pro 


Glu Gin Gly lie 


Glu Asp Ser Phe 


Gin 


Lys 


Val He Leu Arg 




110 


115 


120 


Arg 


Tyr Lys lie His 


His His Ala Cys 


Glu 


Leu 


Gly Pro He Met 




125 


130 




135 


Asn 


His Tyr Pro Thr 


Cys Gly Gin Met 


His 


lie 
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140 145 

<210> 287 

<211> 78 1 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID NO: LI : 801015 . 1 . orfl :2000FEB01 
<400> 287 



Gly 


Ser 


Arg 


Lys 


Met 


Asp 


Ser Val 


Ala 


Phe Glu Asp Val Ala 


Val 


1 






5 








10 


15 


Asn 


Phe 


Thr 


Gin 


Glu 


Glu 


Trp 


Ala 


Leu 


Leu Asp Pro Trp Gin 


Lys 










20 










25 


30 


Lys 


Leu Tyr 


Arg 


Asp 


Val 


Met 


Leu 


Glu 


Thr Tyr Arg Asn Leu 


Ala 








35 










1 40 


45 


Ser 


Val 


Gly 


Asp 


Asp 


Asp 


Asn 


He 


Pro 


Ser Leu Arg Glu Gin 


Val 








50 










55 


60 


Ala 


His 


Gin 


Arg 


Tyr 


Phe 


Lys 


Thr 


Trp 


His Val Glu Arg Glu 


Tyr 








65 








70 


75 


Phe 


Ser 


Lys 



















<210> 288 
<211> 126 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No : LI : 801236 . 1 .orf 3 : 2000FEB01 
<220> 

<221> unsure 
<222> 4 

<223> unknown or other 
<400> 288 



Met 


Trp 


Glu 


Xaa 


Phe 


Ser 


His 


Thr 


Pro 


Ala 


Phe 


He 


Gin His 


Gin 


1 






5 










10 








15 


Arg 


He 


His 


Thr 


Gly 


Glu 


Lys 


Pro 


Tyr 


Glu 


Cys 


Asn 


Ala Cys 


Gly 










20 










25 








30 


Lys 


Ala 


Phe 


Asn 


Arg 


Ser 


Ala 


His 


Leu 


Thr 


Glu 


His 


Gin Arg 


Thr 








35 










40 








45 


His 


Thr 


Gly Glu 


Lys 


Pro 


Tyr 


Val 


Cys 


Lys 


GlU 


Cys Gly Lys 


Thr 










50 










55 








60 


Phe 


Ser 


Arg 


Ser 


Thr 


His 


Leu 


Thr 


Glu 


His 


Leu 


Lys 


He His 


Ser 








65 










70 








75 


Cys 


Val 


Lys 


Pro 


Tyr 


Gin 


Cys 


Asn 


GlU 


Cys 


Gin 


Lys 


Leu Phe 


Cys 










80 










85 








90 


Tyr 


Arg 


Thr 


Ser 


Leu 


He 


Arg 


His 


Gin 


Arg 


Thr 


His 


Thr Gly 


Glu 










95 










100 








105 


Lys 


Pro 


Tyr 


Gin 


Cys 


Asn 


Glu 


Cys 


Gly 


Lys 


Ser 


Phe 


Ser Leu 


Ser 








110 








115 








120 


Ser 


Ala 


Leu 


Thr 


Lys 


His 



















125 

<210> 289 

<211> 96 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 803335 . 1 . orf 1 : 2000FEB01 



149/228 



BNSDOCID: <WO 01 62927A2_I_> 



WO 01/62927 



PCT/US01/06059 



<400> 289 



Ser 


Ala 


Ala 


Met Phe 


Pro Val Phe Ser Gly 


Cys 


Phe Gin Glu 


Leu 


1 






5 


10 




15 


Gin 


Glu 


Lys 


Asn Lys 


Ser Leu Glu Leu Val 


Ser 


Phe Glu Glu 


Val 








20 


25 






30 


Ala 


Val 


His 


Phe Thr 


Trp Glu Glu Trp Gin 


Asp 


Leu Asp Asp 


Ala 






Thr 


35 


40 






45 


Gin 


Arg 


Leu ,Tyr 


Arg Asp Val Met, Leu 


Glu 


Thr Tyr Ser 


Ser 








' , 50 


55 






60 


Leu 


Val 


Ser 


Leu- Gly 


His cys lie Thr Lys 


Pro 


Glu Met He 


Phe 








65 


70 






75 


Lys 


Leu 


Glu 


Gin Gly 


Ala Glu Pro Trp lie 


Val 


Glu Glu Thr 


Leu 








80 


85 






90 


Asn 


Leu 


Arg 


Leu Ser 


Gly ' 














95 











<210> 290 
<211> 149 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 803998 . l'.orfl :2000FEB01 
<400> 290 



Lys 


Asn 


Ser 


Tyr Trp 


Arg 


Lys 


Asn Pro 


Thr 


Asn 


Met Lys Asn 


Val 


1 






5 








10 






15 


Ala 


Lys 


Leu 


Leu He 


Asn 


Ser 


Gin Arg 


Leu 


Leu 


Asn He Arg 


Glu 








20 






25 






30 


Phe 


Val 


Gin 


Glu Gly 


Asn 


Pro 


Thr Asn 


Leu 


Lys 


Asn Val Ala 


Ser 








35 








40 




45 


Leu 


Leu 


Ala 


He Pro 
50 


Gin 


Ser 


Leu Leu 


Asn 
55 


He 


His Val He 


His 
60 


Thr 


Gly 


Gly 


Asn Ser 
65 


Tyr 


Asn 


Cys Val 


Glu 
70 


Cys 


Cys Asn Ala 


Leu 
75 


Asn 


Gin 


Ser 


Leu Arg 

' : so 


Leu 


Thr 


Thr Tyr 


Lys 


Thr 


Thr His Thr 


Gly 












85 






90 


Glu 


Lys 


Pro 


Cys Met 
95 


Cys 


Glu 


Glu, Cys 


Gly 
100 


Lys 


Ala Ser Asn 


Arg 
105 


Ser 


Ser 


He 


Leu Lys 


Arg 


His 


Lys Leu 


lie 


His 


Thr Gin Glu 


Arg 








110 






115 






120 


Leu 


Tyr 


Lys 


Pro Glu 
125 


Arg 


Cys 


Asp Asn 


Ala 
130 


Phe 


Gly Asn Thr 


Ser 
135 


Asp 


Phe 


Ser 


Glu Tyr 
140 


Lys 


Arg 


Asn Arg 


Thr 
145 


Asp 


Glu Lys Ser 





<210> 291 
<211> 134 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 478757 . 1 . orf 2 :2000FEB01 



<400> 291 

Trp Trp Glu He Cys Ala His Ser Asp Val Ala Ala Glu Glu Gly 
15 10 15 

Lys Ala Arg Arg Ser Arg Gin His Arg Phe Leu Gly Thr Cys Glu 

20 25 30 

Gly He Met Arg Arg Ala Glu Leu Ser Ser Gin Val Glu Asp Ser 

35 40 45 

Thr Leu His Ala Trp He Arg Tyr Ser Leu Val Leu Asp Val Asp 

50 55 60 

Cys Trp His He Ala Ala Gin Leu Glu Met Tyr Gly Cys Pro His 

65 70 75 



150/228 



BNSDOCID: <WO 0162927A2J_> 
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Leu Asp Leu Thr Glu Ser Arg Gly Ala Ala Ala Arg Lys Leu His * 
80 85 90' 

Leu Leu Gly Phe Ser Ala Leu Pro Thr Leu Val Asp Met lie Thr 
95 1 100 105 

Ser Gin Gly Ser Val Ser Phe Arg Asp Val Thr Met Gly Phe Thr 
110 115 120 

Gin ' Glu Glu Trp His His Leu Asp Pro Ala Gin Arg Thr Leu 
125 130 



<210> 292 
<211> 212 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: LI : 808532 . 1 -orf 2 :2000FEB01 
<400> 292 



His 


Asn 


Phe 


Gin 


Leu 


Gin. 


Lvs 


His 


His 


Arg 


He 


His 


Thr 


Gly Glu 


1 








5 








10 








15 


Lys 


Pro 


Phe 


Lys 


Cys 
20 


Glu 


lie 

• 


Cys 


Gly 


Lys 
25 


Ser 


Phe 


Cys 


Leu Arg 
30 


Ser 


Ser 


Leu 


Asn Arg 


His 


Cys 


Met 


Val 


His 


Thr 


Ala 


Glu 


Lys Leu 










35 








40 








45 


Tyr 


Lys 


Ser 


Glu 


Lys 


Tyr 


Gly 


Arg Gly 


Phe 


He 


Asp 


Arg 


Leu Asp 








50 










55 








60 


Leu 


His 


Lys 


His 


Gin 


Met 


lie 


His 


Met 


Gly 


Gin 


Lys 


Pro 


Tyr Asn 








65 










70 








75 


Cys 


Lys 


GlU 


Cys 


Gly 
80 


Lys 


Ser 


Phe 


Lys 


Trp 
85 


Ser 


Ser 


Tyr 


Leu Leu 
90 


Val 


His 


Gin 


Arg Val 


His 


Thr 


Gly Glu 


Lys 


Pro 


Tyr 


Lys 


Cys Glu 










95 










100 








105 


Glu 


Cys 


Gly 


Lys 


Gly 
110 


Tyr 


lie 


Ser 


Lys 


Ser 
• 115 


Gly 


Leu 


Asp 


Phe His 
120 


His 


Arg 


Tlir 


His 


Thr 


Gly 


Glu 


Arg 


Ser 


Tyr 


Asn 


Cys 


Asp Asn Cys 








125 








130 








135 


Gly 


Lys 


Ser 


Phe 


Arg 
140 


His 


Ala 


Ser 


Ser 


He 
145 


Leu 


Asn 


His 


Lys Lys 
150 


Leu 


His 


Cys 


Gin Arg 


Lys 


Pro 


Leu 


Lys 


Cys 


Glu 


Asp 


Cys 


Gly Lys 










155 










160 








165 


Arg 


Leu 


Val 


Cys 


Arg 


Ser 


Tyr 


Cys 


Lys 


Asp 


Gin 


Gin 


Arg Asp His 








170 








175 








180 


Ser 


Gly 


Glu 


Asn 


Pro 


Ser 


Lys 


Cys 


Glu 


Asp 


Cys 


Gly Lys Arg Tyr 










185 










190 








195 


Lys 


Arg 


Arg 


Leu 


Asn 


Leu 


Asp 


He 


He 


Leu 


Ser 


Leu 


Phe 


Leu Asn 



200 205 ' 210 

Asp He 



<210> 293 
<211> 108 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> mi sc_f eature 

<223> Incyte ID No: LI : 443073 . 1 . orf 2 : 2000FEB01 
<400> 293 

Lys Pro Tyr Met Cys Lys Glu Cys Arg Lys Thr Phe Ser Gin Asn 
1 5 10 15 

Ala Gly Leu Ala Gin His Gin Arg He His Thr Gly Glu Lys Pro 

20 25 30 

Tyr Glu Cys Asn Val cys Gly Lys Ala Phe Ser Tyr Ser Gly Ser 

35 ~ 40 45 

Leu Thr Leu His Gin Arg He His Thr Gly <51u Arg Pro Tyr Glu 



151/228 
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50 


55 


60 


Cys 


Lys Asp 


Cys 


Arg 


Lys Ser Phe Arg Gin Arg Ala His Leu 


Ala 








65 


70 


75 


His 


His Glu 


Arg 


He 


His Thr Met Glu Ser Phe Leu Thr Leu 


Ser 








80 


8. 5 


90 


Ser 


Pro Ser 


Pro 


Ser 


Thr Ser Asn Gin Leu Pro Arg Pro Val 


Gly 




i 




95 


100 


105 


Phe 


lie Ser 











<210> 294 

<211> 83 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte. ID No: LI : 47 9 671. 1 .orf 1:2000FEB01 
<400> 294 



Pro 


Ala 


Cys Thr 


Gly 


Gly 


Phe Ala Gly 


Arg' Met Ser Gly His 


Pro 


1 






. 5 






10 


15 


Gly 


Ser 


Trp Glu 


Met 


Asn 


Ser Val' Ala 


Phe Glu Asp Val Ala 


Val 








20 






25 


30 


Asn 


Phe 


Thr Gin 


G'lu 


Glu 


Trp Ala Leu 


Leu Asp pro Ser Gin 


Lys 








35 




40 


45 


Asn 


Leu 


Tyr Arg 


Asp 


Val 


Met Gin Glu 


Thr Phe Arg Asn Leu 


Ala 








50 






55 


60 


Ser 


He 


Gly Asn 


Lys 


Gly 


Glu Asp Gin 


Ser He Glu Asp Gin 


Tyr 








65 






70 


75 


Lys 


Asn 


Ser Ser 


Arg 


Asn 


Leu Arg 







80 

<210> 295 
<211> 180 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI.-810078. 1 . orf 1 : 2000FEB01 
<400> 295 



Pro 


Tyr 


Val Cys Lys 


Glu Cys 


Gly Lys 


Ala 


Phe Thr Gin Tyr 


Ser 


1 




5 






10 




15 


Gly 


Leu 


Ser Met His 


Val Arg 


Ser His 


Ser 


Gly Asp Lys Pro 


Tyr 






20 






25 




30 


Glu 


Cys 


Lys Glu Cys 


Gly Lys 


Ser Phe 


Leu 


Thr Ser Ser Arg 


Leu 






35 






40 




45 


He 


Gin 


His He Arg 


Thr His 


Thr Gly 


Glu 


Lys Pro Phe Val 


Cys 






50 






55 




60 


Val 


Glu 


Cys Gly Lys 


Ala Phe 


Ala Val 


Ser 


Ser Asn Leu Ser 


Gly 






65 






70 




75 


His 


Leu 


Arg Thr His 


Thr Glu 


Glu Lys 


Ala 


Cys Glu Cys Lys 


He 






80 






85 




90 


Cys 


Gly 


Lys Val Phe 


Gly Tyr 


Pro Ser 


Cys 


Leu Asn Asn His 


Met 






95 






100 




105 


Arg 


Thr 


His Ser ,Ala 


Gin Lys 


Pro Tyr 


Thr 


Cys Lys Glu Cys 


Gly 






110 






115 




120 


Lys 


Ala 


Phe Asn Tyr 


Ser Thr 


His Leu 


Lys 


He His Met Arg 


He 






125 






130 




135 


His 


Thr 


Gly Glu Lys 


Pro Tyr 


Glu Cys 


Lys 


Gin Cys Gly Lys 


Ala 






140 






145 




150 


Phe 


Ser 


His Ser Ser 


Ser Phe 


Gin He 


His 


Glu Arg Thr His 


Thr 






155 






160 


165 


Gly 


Glu 


Lys Pro Tyr 


Glu Cys 


Lys Glu 


Cys 


Gly Lys Ala Phe 


Thr 






170 






175 




180 
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<210> 296 

<211> 97 

<212> PRT 

<213> Homo sapiens 

<220> ' 
<221> mis cofeature 

<223> Incyte ID No: LI : 810224 . 1 . orf 3 : 2000FEB01 
<400> 296 



Leu 


Cys 


He 


Ser 


Lys 


His 


Pro 


Gly Glu 


Lys 


Pro 


Tyr 


Glu 


Cys 


Tyr 


1 








5 










10 










15 


Ala 


Cys 


Gly Asn 


Thr 


Phe 


Leu 


Arg 


Lys 


Ser 


Asp 


Leu 


He 


Lys 


His 










20 










25 










30 


Gin 


Arg 


He 


His 


Thr 
35 


Gly 


Glu 


Lys 


Pro 


Tyr 
40 


Glu 


Cys 


Asn 


Glu 


Cys 
45 


Gly 


Lys 


Ser 


Phe 


Ser 
50 


Glu 


Lys 


Ser 


Thr 


Leu 
55' 


Thr 


Lys 


His 


Leu 


Arg 
60 


Thr 


His 


Arg 


Trp 


Glu 


He 


Leu 


Cys 


Met 


Tyr 


Ser 


Met 


Trp 


Lys 


He 








65 










70 










75 


Phe 


Leu 


Leu 


Leu 


Leu 


Gin 


Phe 


His 


Arg 


Thr 


Ser 


Glu 


Lys 


Pro 


His 










•80 








85 








90 


Arg 


Gly 


Glu 


Thr 


Phe 


Trp 


Met 



















95 

<210> 297 
<211> 217 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 817052 . 2 . orf 1 : 2000FEB01 
<400> 297 



Ala 


Ala 


Pro 


Glu 


Ser 


Gin 


Gin 


Gin Arg 


Asn 


Arg Arg Gly Glu 


Arg 


1 








5 








10 




15 


Pro 


Phe 


Thr 


Cys 


Met 


Glu 


Cys 


Gly Lys 


Ser 


Phe Arg Leu Lys 


He 










20 








25 




30 


Asn 


Leu 


He 


He 


His 


Gin Arg 


Asn His 


He 


Lys Glu Gly Pro 


Tyr 










35 








40 




45 


GlU 


Cys 


Ala 


Glu 


Cys 


Glu 


He 


Ser Phe 


Arg 


His Lys Gin Gin 


Leu 








50 








55 


60 


Thr 


Leu 


His 


Gin 


Arg 


He 


His 


Arg Val 


Arg 


Gly Gly Cys Val 


Ser 










65 








70 




75 


Pro 


Glu 


Arg 


Gly 


Pro 


Thr 


Phe 


Asn Pro 


Lys 


His Ala Leu Lys 


Pro 






80 








85 




90 


Arg 


Pro 


Lys 


Ser 


Pro 


Ser 


Ser 


Gly Ser 


Gly 


Gly Gly -Gly Pro 


Lys 










95 








100 




105 


Pro 


Tyr 


Lys 


Cys 


Pro 


Glu 


Cys 


Asp Ser 


Ser 


Phe Ser His Lys 


Ser 










110 








115 




120 


Ser 


Leu 


Thr 


Lys 


His 


Gin 


He 


Thr His 


Thr 


Gly Glu Arg Pro 


Tyr 








125 








130 




135 


Thr 


Cys 


Pro 


Glu 


Cys 


Lys 


Lys 


Ser Phe 


Arg 


Leu His He Ser 


Leu 










140 








145 




150 


Val 


He 


His 


Gin 


Arg 


Val 


His 


Ala Gly 


Lys 


His Glu Val Ser 


Phe 










155 






160 




165 


He 


Cys 


Ser 


Leu 


Cys 


Gly Lys 


Ser Phe 


ser 


Arg Pro Ser His 


Leu 










170 








175 




180 


Leu 


Arg 


His 


Gin Arg 


Thr 


His 


Thr Gly 


Glu 


Arg Pro Phe Lys 


Cys 










185 






190 


195 


Pro 


Glu 


Cys 


Glu 


Lys 


Ser 


Phe 


Ser Glu 


Lys 


Ser Lys Leu Thr 


Asn 










200 








205 


210 


His 


cys 


Arg 


Val 


His 


Ser 


Arg 











215 
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<210> 298 
<211> 137 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 892274 . 1 .orf 3 :2000MAY19 
<400> 298 



Asp 




vaiy 


Leu Asp 


Leu Gly 


Pro Thr Asn 


Ser 


uiu L>iy lie Pro 


1 








5 




10 




15 


Ser 


Pro 


Asp 


Leu 


Asn 


Pro Val 


Leu Gly Met 


Gly 


Ser Trp Arg His 










20 




25 


30 


He 


Asp 


Ser 


He 


Thr 


Pro Gly 


Thr Pro Gly 


Ser 


Ala Gly Leu Asp 










35 




40 




45 


Leu 


Pro 


Ala 


Arg 


Glu 


Arg He 


Thr Leu Val 


Gly 


Gly Asp Lys Pro 










50 




55 




60 


He 


Lys 


Val 


Pro 


Thr 


Gly He 


Trp Gly Thr 


Ser 


Pro Ala Gly Tyr 










65 




' 70 




75 


Met 


Gly 


Leu 


He 


Leu 


Gly Lys 


Ser Arg Leu 


Asn 


Leu Gin Gly Met 










80 




85 




90 


Thr 


Val 


Val 


Pro 


Gly 


Ala Val 


Asp Ser Asp 


Tyr 


Glu Gly Glu Thr 










95 




100 




105 


Gin 


Val 


Val 


Leu 


Met 


Ser Gin 


Asp Leu Trp 


Val 


Phe Glu Leu Gly 


Glu 








110 




115 




120 


Tyr 


He 


Ala 


Gin 


Leu Leu 


Leu He Pro 


cys 


Lys Leu His Pro 










125 




130 


135 


Ser 


Pro 

















<210> 299 
<211> 169 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 1080959 . 1 . orf 2 : 2000MAY19 
<400> 299 



Pro 


Lys 


Gin Gly 


He 


Asn Val Trp Ser 


Pro 


Arg His Pro Glu Asn 


1 






5 


10 


15 


Phe 


Leu 


Gly He 


Glu 


Ser Arg Pro Pro 


Met 


Leu Ser Leu Ser Pro 








20 




25 


30 


He 


Leu 


Leu Tyr 


Thr 


Cys Glu Met Phe 


Gin 


Asp Pro Val Ala Phe 








35 




40 


45 


Lys 


Asp 


Val Ala 


Val 


Asn Phe Thr Gin 


Glu 


Glu Trp Ala Leu Leu 








50 




55 


60 


Asp 


He 


Ser Gin 


Lys 


Asn Leu Tyr Arg 


GlU 


Val Met Leu Glu Thr 








65 




70 


75 


Phe 


Trp 


Asn Leu 


Thr 


Ser He Gly Lys 


Lys 


Trp Lys Asp Gin Asn 








80 




85 


90 


He 


Glu 


Tyr Glu 


Tyr 


Gin Asn Pro Arg 


Arg 


Asn Phe Arg Ser Val 








95 




100 


105 


Thr 


Glu 


Glu Lys 


Val 


Asn Glu He Lys 


Glu 


Asp Ser His Cys Gly 








110 




115 


120 


Glu 


Thr 


Phe Thr 


Pro 


Val Pro Asp Asp 


Arg 


Leu Asn Phe Gin Lys 








125 




130 


135 


Lys 


Lys 


Ala Ser 


Pro 


Glu Val Lys Ser 


Cys 


Asp Ser Phe Val Cys 








140 




145 


150 


GlU 


Val 


Gly Leu Gly 


Asn Ser Ser Ser 


Asn 


Met Asn He Arg Gly 








155 




160 


165 


Asp 


Thr 


Gly His 











<210> 300 



154/228 



BNSDOCID: <WO 0162927A2_L> 



WO 01/62927 

<211> 135 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> mis cofeature 
<223> Incyte ID No: 

<400> 300 



Asp 


Ala 


Trp 


Ala 


Arg 


Pro 


Pro 


Val 


Leu 


Ser 


Leu 


Ser Pro He Leu 


1 






5 










10 




15 


Leu 


Tyr 


Thr 


Cys 


GlU 


Met 


Phe 


Gin 


Asp 


Pro 


Val 


Ala Phe Asp Asp 








20 










25 




30 


Val 


Ala 


Val 


Asn 


Phe 


Thr 


Gin 


Glu 


Glu 


Trp 


Ala 


Leu Leu Asp He 










35 










40 




45 


Ser 


Gin 


Arg 


Lys 


Leu 


Tyr 


Lys 


Glu 


Val 


'Met 


Leu 


Glu Thr. Phe Arg 






50 






. 55 




60 


Asn 


Leu 


Thr 


Ser 


Val 


Gly 


Lys 


Ser 


Trp 


Lys 


Asp 


Gin Asn He Glu 










65 








70 




75 


Tyr 


Glu 


Tyr 


Gin 


Asn 


Pro 


Arg 


Arg 


Asn 


Phe 


Arg 


Ser Leu He Glu 








80 








85 




90 


Lys 


Lys 


Val 


Asn 


Glu 


He 


Lys 


Asp 


Asp 


Ser 


His 


Cys Gly Glu Thr 










95 










100 




105 


Phe 


Thr 


Gin 


Val 


Pro 


Asp 


Asp 


Arg 


Leu 


Asn 


Phe 


Gin Glu Lys Lys 










110 






115. 




120 


Ala 


Ser 


Pro 


Glu 


He 


Lys 


Ser 


Cys 


Asp 


Ser 


Phe 


Val Cys Gly Lys 



125 13 0 135 



<210> 301 
<211> 170 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LG : 1077357 . 1 . orfl : 2000MAY19 
<400> 301 



Thr 


Val 


Met 


Leu 


Cys 


Asp 


Glu Glu 


Ala 


Gin 


Lys 


Arg Lys 


Ala 


Lys 


1 








5 






10 








15 


GlU 


Ser 


Gly Met 


Ala 


Leu 


Pro Gin Gly 


Arg 


Leu 


Thr Phe 


Met 


Asp 










20 








25 








30 


Val 


Ala 


He 


Glu 


Phe 


Ser 


Gin Glu 


Glu 


Trp 


Lys 


Ser Leu Asp 


Pro 










35 








40 








45 


Gly 


Gin 


Arg Ala 


Leu 


Tyr 


Arg Asp 


Val 


Met 


Leu 


Glu Asn 


Tyr 


Arg 










50 








55 




i 




60 


Asn 


Leu 


Val 


Phe 


Leu 


Gly 


He Cys 


Leu 


Pro 


Asp 


Leu Ser 


He 


He 










65 




70 








75 


Ser 


Met 


Leu 


Lys 


Gin 


Arg 


Arg Glu 


Pro 


Leu 


lie 


Leu Gin 


Ser 


Gin 








80 




85 








90 


Val 


Lys 


He 


Val 


Lys 


Asn 


Thr Asp 


Gly 


Arg 


Glu Cys Val Arg 


Ser 








95 








100 








105 


Val 


Asn 


Thr 


Gly 


Arg 


Ser 


Cys Val 


Leu 


Gly 


Ser 


Asn Ala 


Glu 


Asn 










110 






115 








120 


Lys 


Pro 


He 


Lys 


Asn 


Gin 


Leu Gly 


Leu 


Thr 


Leu 


Glu Ser 


His 


Leu 










125 








130 








135 


Ser 


Glu 


Leu 


Gin 


Leu 


Phe 


Gin Ala Gly 


Arg 


Lys 


He Tyr 


Arg 


Ser 










140 








145 








150 


Asn 


Pro 


Val 


Glu 


Lys 


Phe 


Thr Asn 


His 


Arg 


Ser 


Ser Val 


Ser 


Pro 










155 








160 








165 


Leu 


Gin 


Lys 


He 


Ser 
























170 



















<210> 302 
<211> 181 
<212> PRT 



155/228 
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LG : 1054900. l.orf 3 :2000MAY19 



BNSDOCID: <WO 0162927A2_I_> 



WO 01/62927 

<213> Homo sap. 



PCTAJS01/06059 



<220> 

<221> misc_feature , 

<223> Incyte ID No: LG: 1084051 . 1 .orf3 :2000MAY19 
<400> 302 



Thr 


Ser 


Tvr 


He Arg 


Thr Lys Thr Tyr Glu Cys 


Asn He Cys Glu 


1 








5 




10 


15 


Lys 


He 


Phe 


Lys. Gin 


Pro He His Leu 


Thr Glu 


His Met Arcr Tie 










20 




25 


30 


His 




Glv 


Glu Lys 


Pro Phe Arg Cys 


Lys Glu 












35 




40 


' ' 45 


Phe 


Ser 


Gin 


Ser 


Ala 


Ser Leu Ser Thr 


His Gin 


Arcr Tl & Hi s Thr 

ni vj ij,c mo xxix 










50 




55 


60 


Gly 


Glu 


Lys 


Pro 


Phe 


Glu cys Glu Glu cys Gly 












65 




70 


75 


His 


Arg 


ser 


Ser 


Leu 


Asn Gin His His 


Arg Thr 


His Thr Gly Glu 










80 




85 


90 


Lys 


Pro 


Tyr 


Val 


Cys 


Asp Lys Cys Gin 


Lys Ala 


Phe Ser Gin Asn 










95 




100' 


105 


He 


Ser 


Leu 


Val 


Gin 


His Leu Arg Thr 


His Ser 


Gly Glu Lys Pro 










110. 


115 


120 


Phe 


Thr 


Cys 


Asn 


Glu 


Cys Gly Lys- Thr 


Phe Arg 


Gin He Arg His 










125 


Arg He His Thr 


130 


135 


Leu 


Ser 


Glu 


His 


He 


Gly Glu 


Lys Pro Tyr Ala 










140 




145 


150 


Cys 


Thr 


Ala 


Cys 


Cys 


Lys Thr Phe Ser 


His Arg 


Ala Tyr Leu Thr 










155 




160 


165 


His 


His 


Gin 


Arg 


Ser 


He Leu Gly Arg 


Asp Leu 


Gin Cys Lys Glu 










170 




175 


180 


Cys 

















<210> 303 
<211> 263 
<212> PRT 
<213> Homo sapieps 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 1076853 . 1 .orfl :2000MAY19 
<400> 303 



Ala 


Phe 


Ser 


Arg 


cys 


Ser 


Ser 


Leu 


Val 


Gin 


His 


Glu Arg Thr His 


1 








5 










10 




15 


Thr 


Gly 


Glu 


Lys 


Pro 


Phe 


Glu 


Cys 


Ser 


He 


Cys 


Gly Arg Ala Phe 










20 










25 


30 


Gly 


Gin 


Ser 


Pro 


Ser 


Leu 


Tyr 


Lys 


His 


Met 


Arg 


He His Lys Arg 










35 










40 




45 


Gly 


Lys 


Pro 


Tyr 


Gin 


Ser 


Ser 


Asn 


Tyr 


Ser 


He 


Asp Phe Lys His 










50 










55 




60 


Ser 


Thr 


Ser 


Leu 


Thr 


Gin 


Asp 


Glu 


Ser 


Thr 


Leu 


Thr Glu Val Lys 










65 










70 




75 


Ser 


Tyr 


His 


Cys 


Asn 


Asp 


Cys 


Gly 


Glu 


Asp 


Phe 


Ser His He Thr 










80 










85 




90 


Asp 


Phe 


Thr 


Asp 


His 


Gin 


Arg 


He 


His 


Thr 


Ala 


Glu Asn Pro Tyr 










95 










100 




105 


Asp 


Cys 


Glu 


Gin 


Ala 


Phe 


Ser 


Gin 


Gin 


Ala 


He 


Ser His Pro Gly 










110 










115 




120 


Glu 


Lys 


Pro 


Tyr 


Gin 


Cys 


Asn 


Val 


Cys 


Gly 


Lys 


Ala Phe Lys Arg 










125 










13 0 




135 


Ser 


Thr 


Ser 


Phe 


He 


Glu 


His 


His 


Arg 


He 


His 


Thr Gly Glu Lys 










140 








145 




150 


Pro 


Tyr. Glu 


Cys 


Asn 


Glu 


Cys 


Gly 


Glu 


Ala 


Phe 


Ser Arg Arg Ser 










155 








160 




165 


Ser 


Leu 


Thr 


Gin 


His 


GlU 


Arg 


Thr 


His 


Thr 


Gly 


Glu Lys Pro Tyr 



156/228 
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170 


175 










180 


GlU 


Cvs 


He 


Asp 


Cys, 


Gly Lys Ala Phe Ser 


Gin 


Ser 


Ser 


Ser 


Leu 








185 


190 










195 


He 


Gin 


His 


GlU 


Arg 
200 


Thr His Thr Gly Glu 
205 


Lys 


Pro 


Tyr 


GlU 


Cys 
21,0 


Asn 


Glu 


Cys 


Gly 


Arg 
215 


Ala Phe Arg Lys Lys 
220 


Thr 


Asn 


Leu 


His 


Asp 
225 


His 


Gin 


Arg 


lie. 


His 
230 


Thr Gly Glu Lys Pro 
235 


Tyr 


Ser 


Cys 


Lys 


Glu 
240 


Cys 


Gly 


Lys 


Asn 


Phe 


Ser Arg Ser Ser Ala 


Leu 


Thr 


Lys 


His 


Gin 






245 


250 






> 




255 


Arg 


He 


His 


Thr 


Arg 
260 


Asn Lys Leu 













<210> 304 
<211> 340 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: LG: 481631 . 10 . orf 3 : 2000MAY19 

i , 

<400> 304 



Arg 


Leu 


Leu 


VaJ. 




vjIU 




P 1 ! ii 


1 LIS. 


bill 


Lys Arg 


Lys Arg 




1 








q 










1 0 






15 




Lys 




del 


m vr 
vvx y 




Ala 


ucu 


Oar 




Gly Leu 


Leu Thr 


Phe 








20 










25 






30 


A TCT 

/vl y ^ 


A en 
Asp 


v ax 


Al » 


Tie 


Glu 


Phe 


Ser 


Gin 


GlU 


Glu Trp 


Lys Cys 


Leu 






35 










40 






45 


A en 




Al a 


Gin 


Arg 


Thr 


Leu 




Arg 


Asp 


Val Met 


Leu Glu 


Asn 








50 










55 






60 




Arg 


Asn 


Leu 


Val 


Ser 


Leu 


Asp 


He 


Ser 


Ser Lys 


Cys Thr 


Met 








65 










70 






75 


Lys 


GlU 


Phe 


Leu 


Ser 


Thr 


Ala 


Gin 


Glv 


Asn 


Arg Glu Val Phe 


His 








,80 










85 






90 


Ala 


Gly 


Thr 


Leu 


Gin 


He 


His 


Glu 


Ser 


His 


His Asn Gly Asp 


Phe 








95 










100 






105 


Cys 


Tyr 


Gin 


Asp 


Val 
110 


Asp 


Lys 


Asp 


He 


His 
115 


Asp Tyr 


Glu Phe 


Gin 
120 


Trp 


Gin 


Glu 


Asp 


Glu 


Arg 


Asn 


Gly 


His 


Glu 


Ala Pro 


Met Thr 


Lys 






125 










13 0 






135 


He 


Lys 


Lys 


Leu 


Thr 


Gly 


He 


Thr 


GlU 


Arg 


Tyr Asp 


Gin Ser 


His 






140 










145 






150 


Ala 


Arg 


Asn 


Lys 


Pro 


He 


Lys 


Asp 


Gin 


Leu 


Gly Ser 


Ser Phe 


His 








155 










160 






165 


Ser 


His 


Leu 


Pro 


Glu 
170 


Met 


His 


He 


Phe 


Gin 
175 


Thr Glu 


Glu Lys 


He 
180 


Asp 


Asn 


Gin 


Val 


Val 


Lys 


Ser 


He 


His 


Asp 


Ala Ser 


Leu Val 


Ser 








185 








190 






195 


Thr 


Ala 


Gin 


Arg 


He 


Ser 


Cys 


Arg 


Pro 


Lys 


Thr His 


He Ser 


Asn 








200 








205 






210 


Asn 


His 


Gly 


Asn 


Asn 


Phe 


Trp 


Asn 


Ser 


Ser 


Leu Leu 


Thr Gin 


Lys 








215 








220 






225 


Gin 


Glu 


Val 


His 


Met 


Arg 


Glu 


Lys 


Ser 


Phe 


Gin Cys 


Asn Glu 


Ser 










230 








235 






240 


Gly 


Lys 


Ala 


Phe 


Asn 


Tyr 


Ser 


Ser 


Leu 


Leu 


Arg Lys 


His Gin 


He 






245 








250 






255 


He 


His 


Leu 


Ala 


Asp 
260 


Lys 


Tyr 


Lys 


Cys 


Asp 
265 


Val Cys 


Gly Lys 


Leu 
270 


Phe 


Asn 


Gin 


Lys 


Arg 


Asn 


Leu 


Ala 


Cys 


His 


Arg Arg Cys His 


Thr 










275 










280 






285 


Gly 


GlU 


Asn 


Pro 


Tyr 
290 


Lys 


Cys 


Asn 


Glu 


Cys 
295 


Gly Lys 


Thr Phe 


Ser 
300 


Gin 


Thr 


Ser 


Ser 


Leu 


Thr 


Cys 


His 


Arg 


Arg 


Leu His, 


Thr Gly Glu 










305 






310 






315 


Lys 


Pro 


Tyr 


Lys 


Cys 


Glu 


Glu 


Cys 


Asp 


Lys 


Ala Phe 


His Phe 


Lys 
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320 325 330 

Ser lie Leu Glu Arg His Arg He He His 
335 340 

i 

<210> 305 

<211> 89 ' 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 1088431 .2 . orfl ;2000MAY19 
<400> 305 

Leu Thr Tyr Leu Arg Lys Lys Leu Arg Gly Arg Gly Lys Lys Glu 
1 5 10 15 

Glu Glu Gly Met Ala Leu Ser Gin Gly Leu Phe Thr Phe Lys Asp 

20 25 30 

Val Ala He Glu Phe Ser Gin Glu Glu Trp Glu Cys Leu Asp Pro 

35 ' 40 45 

Ala Gin Arg Ala Leu Tyr Arg Asp Val Met Leu Glu Asn Tyr Arg 

50 55 60 

Asn Leu Leu Ser Leu Asp Glu Asp Asn He Pro Pro Glu Asp Gly 

65 70 75 

Ser His Leu Ala Ala Cys Gly Gin Ser Thr Leu Pro Leu Pro 

80 85 

<210> 306 

<211> 80 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 401619 . 10 .orfl : 2000MAY01 
<400> 306 

Ala Leu His Pro Leu Pro Lys Arg Gin Ala Leu Glu Lys Ser Asn 
1 5 10 15 

Gly Thr Ser Ala Val Phe Asn Pro Ser Val Leu His Tyr Gin Gin 

20 25 30 

Ala Leu Thr Ser Ala Gin Leu Gin Gin His Ala Ala Phe He Pro 

35 40 45 

Thr Gly Met Cys Pro Tyr Cys Pro Thr Ser Cys Ala Leu Leu Val 

50 55 60 

Met Cys Phe Leu Leu He Ser Leu Ser Cys Leu Val Ala Ser Ser 

65 70 75 

Leu Leu Leu Lys Val 

80 

<210> 307 
<211> 386 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 1144007 . 1 . orf 2 : 2000MAY01 
<400> 307 

Ala Leu Asp Gly Val Arg Pro Pro Gin Ser Gin Glu Met Ala Thr 
15 10 15 

Ala Val Glu Pro Glu Asp Gin Asp Leu Trp Glu Glu Glu Gly He 

20 25 30 

Leu Met Val Lys Leu Glu Asp Asp Phe Thr Cys Arg Pro Glu Ser 

35 40 45 

Val Leu Gin Arg Asp Asp Pro Val Leu Glu Thr Ser His Gin Asn 
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50 1 55 60 



Phe 


Arg 


Arcr 


Phe 


Arcr 


Tyr 


Gin 


Glu 


Ala 


Ala 


Ser 


Pro Arg Glu Ala 






65 








70 








75 


Leu 


lie 


Arg 


Leu 


Arg 


Glu 


Leu 


Cys 


His 


Gin 


Trp 


Leu Arg 


Pro 


Glu 








80 








85 








90 


Arcr 


Arg 


Thr 


Lys 


Glu 


Gin 


He 


Leu 


Glu t 


Leu 


Leu 


Val Leu 


Glu 


Gin 




I 

\ 


95 










100 








105 


Phe 


Leu 


Thr 


Val 


Leu 


Pro 


Gly 


Glu 


Leu 


Gin 


Ser 


Trp Val 


Arg Gly 










110 








115 








120 


Gin 


Arcr 


Pro 


Glu 


Ser 


Gly 


Glu 


GlU 


Ala 


Val 


Thr 


Leu Val 


Glu 


Gly 








125 








130 








135 


Leu 


Gin 


Lvs 


Gin 


Pro 


Arcr 


Arcr 


Pro 


Arg 


Arg 


Trp 


Val Thr 


Val 


His 








140 










145 








150 


Val 


His 


Glv 


Gin 


Glu 


Val 


Leu 


Ser 


Glu- 


Glu 


Thr 


Val His 


Leu Gly 








155 










160 








165 


Ala 


Glu 


Pro 


Glu 


Ser 


Pro 


Asn 


Glu 


Leu 


Gin 


Asp 


Pro Val 


Gin 


Ser 










170 










175 








180 


Ser 


Thr 


Pro 


Glu 


Gin 


Ser 


Pro 


Glu 


GlU 


Thr 


Thr 


Gin Ser 


Pro 












185 










190- 








-L -7 — ' 


Leu 


Gly 


Ala 


Pro 


Ala 


Glu 


Gin 


Arg 


Pro 


His 


Gin 


Glu Glu 


Glu 










200 








205 








X VJ 


Gin 


Thr 


Leu 


Gin 


Glu 


Ser 


Glu 


Val 


Pro 


Val 


Pro 


Glu Asp 


Pro 


nop 










215 










220 








225 


Leu 


Pro 


Ala 


Glu 


Arcr 


Ser 


Ser 


Gly Asp 


Ser 


Glu 


Met Val Ala 


Leu 










230 










235 








240 


Leu 


Thr 


Ala 


Leu 


Ser 


Gin 


Glv 


Leu 


Val 


Thr 


Phe 


Lys Asp Val 


Ala 










245 








250 








255 


Val 


Cys 


Phe 


Ser 


Gin 


Asd 


Gin 


Trp 


Ser 


Asp 


Leu 


Asp Pro 


Thr 


Gin 








260 








265 








270 


Lys 


Glu 


Phe 


Tvx 


Glv 


Glu 


TVX 


Val 


Leu 


Glu 


Glu 


Arg Leu 


Trp 


Asn 






275 








280 








285 


Cys 


Cys 


Leu 


Ser 


Val 


His 


Ser 


Gin 


Ser 


Pro 


Arg 


Pro Asp 


Glu 


He 






290 










295 








300 


Leu 


Pro 




Leu 


Asp 


Glu 


Glu 


Glu 


Pro 


Gly Val 


Pro Asp 


He 


Gin 








305 










310 








315 


Glu 


Pro 


Gin 


GlU 


Thr 


Gin 


Glu 


Pro 


Glu 


He 


Leu 


Ser Phe 


Thr 


Tyr 










320 










325 








330 


Thr 


Gly 


Asp 


Arg 


Ser 


Lys 


Asp 


GlU GlU GlU 


Cys 


Leu Glu 


Gin 


Glu 








335 










340 








345 


Asp 


Leu 


Ser 


Leu 


Glu 


Asp 


He 


His 


Arg 


Pro 


Val 


Leu Gly Glu 


Pro 








350 








355 








360 


Glu 


lie 


His 


Gin 


Thr 


Pro 


Asp 


Trp 


Glu 


He 


Val 


Phe Glu Asp 


Asn 










3 65 






370 








375 


Pro 


Gly 


Arg 


Leu 


Asn 


Glu 


Arg 


Arg 


Phe 


Gly 


Tyr 









380 385 

<210> 308 
<211> 368 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: LI :331074 . 1 .orf 2 :2000MAY01 
<400> 308 



Met 


Cys 


Ser 


Arg 


Lys 


Lys 


Ala 


Glu 


Phe 


He 


Lys 


Gly 


Ser His 


Lys 


1 






5 








10 








15 


Cys 


Asn 


Val 


cys 


Ser 


Arg 


Thr 


Phe 


Phe 


Ser 


GlU 


Asn Gly Leu Arg 






20 








25 








30 


Glu 


His 


Leu 


Gin 


Thr 


His 


Arg 


Gly Pro 


Ala 


Lys 


His 


Tyr Met 


Cys 










35 










40 








45 


Pro 


He 


Cys 


Gly Glu 


Arg 


Phe 


Pro 


Ser 


Leu 


Leu 


Thr 


Leu Thr 


Glu 








50 








55 








60 


His 


Lys 


Val 


Thr 


His 


Ser 


Lys 


Ser 


Leu 


Asp 


Thr 


Gly 


Thr Cys 


Arg 








65 








70 








75 


He 


Cys 


Lys 


Met 


Pro 


Leu 


Gin 


Ser 


Glu 


Glu 


Glu 


Phe 


He Glu 


His 
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Cys Gin Met His Pro Asp 
95 

Cys Val Val Cys Met Gin 
110 

He His Gly Thr Phe His 
125 

Ala Ser Ser Pro Asn Gly 
140 

Ala Leu Cys Leu Lys Glu 
155 

Leu Asp Val Asn Gly Leu 
170 

Ala Arg Arg Ala Asn Gly 
185 

Pro Ala Asp Arg Pro Cys 
200 

Val Lys Phe Glu Ser Ala 
215 

Asp His Arg Asp Leu Thr 
230 



Thr 


Gin 


Thr Ser 


Pro 


Val 


Pro 


Arg 


Lys 


Lys 


Thr Tyr Gin 


Cys He 








245 




Glu 






250 




255 


Lys 


Cys 


Gin Met 


Thr 


Phe 


Asn 


Glu 


Arg 


Glu He Gin 


He His 




260 










265 




270 


Asp 


Ala 


Asn His 


Met 


He 


Glu 


Glu 


Gly 


He 


Asn His Glu 


Cys Lys 






275 








280 




285 


Leu 


Cys 


Asn Gin 


Met 


Phe 


Asp 


Ser 


Pro 


Ala 


Lys Leu Leu 


Cys His 






290 








295 


300 


Leu 


He 


Glu His 


Ser 


Phe 


Glu 


Gly 


Met 


Gly 


Gly Thr Phe 


Lys Cys 








305 










310 




■ 315 


Pro 


Val 


Cys Phe 


Thr 


Val 


Phe 


Val 


Gin 


Ala 


Asn Lys Leu 


Gin Gin 








320 










325 




330 


His 


He 


Phe Ala 


Val 


His 


Gly 


Gin 


Glu 


Asp 


Lys He Tyr 


Asp Cys 








335 










340 




345 


Ser 


Gin 


Cys Pro 


■Gin 


Lys 


Phe 


Phe 


Phe 


Gin 


Thr Glu Leu 


Gin Asn 






350 








355 




360 


His 


Thr 


Met Ser 


Gin 


His 


Ala 


Gin 











365 

<210> 309 
<211> 175 
<212> PRT 
<213> Homo .sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 1170349 . 1 .orf 2 : 2000MAY01 
<400> 309 



Val 


Thr Phe Gin Asp 


Val Ala He Asp 


Phe 


Ser 


Lys Glu 


Glu 


Trp 


1 


5 


10 








15 


Gly 


Phe Leu Asn Pro 


Ala Gin Arg Asp 


Leu 


Tyr 


Thr Thr 


Val 


Met 




20 


25 








30 


Leu 


Glu Asn Tyr Gin 


Asn Leu Val Trp 


Leu 


Gly 


Leu Ser 


He 


Ser 




35 




40 








45 


Lys 


Ser Val He Ser 


Leu Leu Glu Lys 


Arg 


Lys 


Leu Pro 


Trp 


He 




50 




55 








60 


Met 


Ala Lys Glu Glu 


He Arg Gly Pro 


Leu 


Pro 


Asp Val 


Pro 


Gly 




65 




70 








75 


Ala 


Glu He Lys Glu 


Leu Ser Ala Lys 


Arg 


Ala 


He Asn 


Glu Val 




80 


85 








90 


Leu 


Ser Gin Phe Asp 


Thr Val He Lys 


Cys 


Thr 


Arg Asn 


Val 


Cys 




95 


100 








105 


Lys 


Glu Cys Gly Asn 


Leu Tyr Cys His 


Asn 


Met 


•Gin Leu 


Thr 


Leu 




110 


115 








120 


His 


Lys Arg Asn His 


Thr Gin Lys Lys 


Cys 


Asn Gin Cys 


Leu Asp 



Leu 


Arg 


Asn 


Ser 


Leu Thr Gly 


Phe Arg 








100 




105 


Thr 


Val 


Thr 


Ser 


Thr Leu Glu 


Leu Lys' 








115 




120 


Met 


Gin 


Lys 


Leu 


Ala Gly Ser 


Ser Ala 






130 




135 


Gin 


Gly 


Leu 


Gin 


Lys Leu Tyr 


Lys Cys 








145 




150 


Phe 


Arg 


Ser 


Lys 


Gin Asp Leu 


Val Lys 








J.OU 




lob 


Pro 


Tyr 


Gly 


Leu 


Cys Ala Gly 


Cys Met 








175 




180 


Gin 


Val 


Gly 


Gly 


Leu Ala Pro 


Pro Glu 






190 




195 


Ala 


Gly 


Leu 


Arg 


Cys Pro Glu 


Cys Ser 








205 




210 


Glu 


Asp 


Leu 


Glu 


Ser His Met- 


Gin Val 






220 




225 


Pro 


GlU 


Thr 


Ser 


Gly Pro Arg 


Lys Gly 
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125 130 135 

Cys Gly Lys Tyr Phe Thr Arg Gin Ser Pro Leu lie Gin His Gin 1 

140 145 150 

Arg lie His Thr Gly Glu Arg Pro Tyr Lys Cys Asn Glu Cys lie 

155 160 165 

Lys Thr Phe Asn Gin Arg Ala His Leu Thr 

170 175 

<210> 310 

<211> 78 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG : 33 5097 . 1 . orf 2 : 2000FEB18 
<400> 310 

Thr Gin Glu Val Glu Val Ala Val Ser Leu Asp Cys Cys His Cys 
1 , 5 10 15 

Thr Pro Ala Trp Val Thr Ala Glu Leu cys Leu Lys Lys Lys Met 

20 25 30 

Trp Asn Ser Phe Leu Gin Met Phe Ser Asn Ser He Pro Ser Ser 

35 40 45 

Val Cys Arg Tyr Met Tyr Ala He He Leu Gin Val He His Val 

50 55 60 

Asp Cys He Gly Asn Tyr Arg Lys Asp Tyr He Gly Leu Phe Arg 

65 , 70 75 

Lys Tyr Phe 

<210> 311 

<211> 61 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG : 1076451 . 1 . orf 1 : 2000FEB18 
<400> 311 

Glu Gin Ser Ser Val Gin Gly Arg Ser Val Glu Val Leu Thr Val 

15 10 15 

Gin Val Gin Met Leu Arg Asn Met Ser Pro Ala Met Ser Phe Leu 

20 25 30 

Met Leu Gin Pro Cys Val Asp Gin Ser Ala Ser Gly cys Asp Trp 

35 40 45 

Ser Arg Ala Cys Arg Leu Leu Gin Ser Tyr Phe Ala Gly Val Gly 

50 55 60 

Glu 



<210> 312 

<211> 73 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 805478 . 1 . orf 1 : 2000FEB01 
<400> 312 

Gly Leu Phe Gin Cys He His Gin Val Thr Glu Val Gly Gin Lys 
15 10 15 

Val Ala Thr Val Leu Leu Phe Tyr Gly Tyr Tyr Lys Cys Thr Gly 
20 25 30 

Thr Leu Lys He Thr Cys Leu Tyr Asn Val He Leu Tyr Lys Val 
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35 40 ^ 45 

Cys Ser Pro Gly Ser Asp Gin Pro Asp Val Cys Tyr Asp Pro Ser 
50 55 60 

Glu Leu Pro Met Thr Thr Val Phe Lys lie Arg Leu Arg 
65 70 

<210> 313 ( 

<211> 184 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> mis cofeature 

<223> Incyte ID No: LG: 101269 . 1 .orfl :2000MAY19 
<400> 313 



Cys 


Cys 


Ser 


Phe 


Lys 


Phe His Phe Asp 


Leu 


Ser 


Trp Glu He 


Leu 


1 








5 




10 






15 


Trp 


Pro 


He 


He 


Pro 


Trp Met Leu Lys 


Met 


Val 


Leu Thr Glu 


Asn 








20 




25 






. 30 


Pro 


Asn 


Gin 


Glu 


He 


Ala Thr Ser Leu 


Glu* 


Phe 


Leu Leu Leu 


Gin 










35 




40 






45 


Asn 


Ser 


Pro 


Gly 


Ser 


Leu Arg Ala' Gin 


Gin 


Arg 


Met Ser Tyr 


Tyr 










50 




55 






60 


Gly 


Ser 


Ser 


Tyr 


His 


He He Asn Ala 


Asp 


Ala 


Lys Tyr Pro 


Gly 










65 




70 






75 


Tyr 


Pro 


Pro 


Glu 


His 


He He Ala Glu 


Lys 


Arg 


Arg Ala Arg 


Arg 










80 




85 






90 


Arg 


Leu 


Leu 


His 


Lys 


Asp Gly Ser Cys 


Asn 


Val 


Tyr Phe Lys 


His 










95 




100 






105 


He 


Phe 


Gly 


Glu 


Trp 


Gly Ser Tyr Val 


Val 


Asp 


He Phe Thr 


Thr 










110 




115 






120 


Leu 


Val 


Asp 


Thr 


Lys 


Trp Arg His Met 


Phe 


Val 


He Phe Ser 


Leu 








125 


130 






135 


Ser 


Tyr 


He 


Leu 


Ser 


Trp Leu He Phe 


Gly 


Ser 


Val Phe Trp 


Leu 








140 


145 






150 


He 


Ala 


Phe 


His 


His 


Gly Asp Leu Leu 


Asn 


Asp 


Pro Asp He 


Thr 










i55 




160 






165 


Pro 


Cys 


Val 


Asp 


Asn 


Val His Ser .Phe 


Thr 


Gly 


Ala Phe Leu 


Phe 










170 




175 






180 


Ser 


Leu 


Glu 


Thr 















<210> 314 
<211> 219 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> mis c_feature 

<223> Incyte ID No: LI : 331087 . 1 .orf 2 : 2000MAY01 
<400> 314 



Leu 


Ser 


Gly His Val 


Gin 


Thr 


Leu Glu 


Ser 


Pro Pro Gin Cys 


Ser 


1 




5 








10 




15 


Pro 


Ala 


Pro Gly Gin 


Pro 


Asn 


Phe Cys 


Leu 


Leu Asp Gly Asp 


Gin 






20 








25 




30 


Val 


Ala 


Ala Ala Gly 


Ala 


Gly 


Ala Val 


Pro 


Ala Gly Val Glu 


Cys 






35 






40 




45 


Leu 


Gly 


Leu Leu Val 


Arg 


Gin 


Arg Gly 


Arg 


Gly Gin Lys Cys 


Leu 






50 








55 




60 


Pro 


Ser 


Leu Pro Gin 


Thr 


Gin 


Glu Ala 


Gly 


Pro Ala Ala Ala 


Leu 






65 








70 




75 


Arg 


Pro 


Arg Ser Thr 


Pro 


Cys 


Phe Val 


Tyr 


Gin Pro Ala He 


Arg 




80 








85 




90 


Glu 


Ala 


Asn Gly He 


Val 


Glu 


Cys Gly 


Pro 


Cys Gin Lys Val 


Phe 






95 






100 




105 
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Val 


Val 


Gin 


Gin 


He 


Pro Asn Ser Asn Leu 


Leu 


Leu Leu Val Thr 










HQ 


115 




120 


Asp 


Pro 


Thr 


Cys 


Asp 


Cys Ser lie Phe Pro 


Pro 


Val Leu Gin Glu 








125 


. 13° 




135 


Ala 


Thr 


Glu 


Val 


Lys 


Tyr Asn Ala Ser Val 


Lys 


Cys Asp Arg Met 










140 


' 145 




150 


Arg 


Ser 


Gin 


Lys 


Leu 


Arg Arg Arg Pro Asp 


Ser 


Cys His Ala Phe 








155 


160 




165 


His 


Pro 


Glu 


Val 


Arg 


Val Glu Ala Asp Arg 


Gly 


Trp Ala Gly Phe 










170 


175 




180 


Ser 


Ser 


Pro 


Asn 


Pro 


Leu Cys Leu Gly Leu 


Cys 


Pro' Cys Arg Gin 










185 


190 




195 


Glu 


His 


He 


Gly Met 


Pro Met Asn Thr Pro 


Val 


Pro Val Leu Leu 










200 


■ 205 




210 


Gly 


Gly 


Asn 


He 


Arg 


Val Tyr Ala Leu 














215 









<210> 315 
<211> 1603 
<212> PRT 

<213> Homo sapiens 

<220> ■ 
<221> misc_feature 

<223> Incyte ID No: LI : 41 0188 . 1 .orf 1 : 200 0MAY01 
<400> 315 



Ala 


Ala 


Glu 


Pro 


Ala 


Pro 


Ser 


Ala 


Pro 


Ser 


Pro 


Ala 


Pro 


Ser Arg 


1 








5 










10 








15 


Val 


Arg 


Ala 


Lys 


Gin 


Gin 


Pro 


Pro 


Thr 


Pro 


Gly Pro 


Gly Arg Gly 






20 










25 








30 


Thr 


Ser 


Ser 


Phe 


Pro 


Thr 


Gly Asn Val 


Arg 


Arg Ala 


Cys 


Ala Gin 










35 










40 








45 


Lys 


Gin 


Asp 


Glu 


Lys 


Met 


Ala 


Asn 


Phe 


Leu 


Leu 


Pro 


Arg Gly Thr 






50 










55 








60 


Ser 


Ser 


Phe 


Arg 


Arg 


Phe 


Thr Arg Glu 


Ser 


Leu 


Ala 


Ala 


He Glu 








65 










70 








75 


Lys 


Arg 


Met 


Ala 


Glu 


Lys 


Gin Ala Arg 


Gly 


Ser 


Thr 


Thr 


Leu Gin 






80 








85 








90 


Glu 


ser 


Arg 


Glu 


Gly 


Leu 


Pro 


Glu 


Glu 


Glu 


Ala 


Pro 


Arg 


Pro Gin 








95 










100 








105 


Leu 


Asp 


Leu 


Gin 


Ala 


Ser 


Lys 


Lys 


Leu 


Pro 


Asp Leu 


Tyr 


Gly Asn 








110 






115 








120 


Pro 


Pro 


Gin 


Glu 


Leu 


He 


Gly Glu Pro 


Leu 


Glu Asp 


Leu Asp Pro 










125 










130 








135 


Phe 


Tyr 


Ser 


Thr 


Gin 


Lys 


Thr 


Phe 


He 


Val 


Leu 


Asn 


Lys Gly Lys 








140 








145 








150 


Thr 


He 


Phe 


Arg 


Phe 


Ser 


Ala 


Thr 


Asn 


Ala 


Leu Tyr 


Val 


Leu Ser 








155 










160 








165 


Pro 


Phe 


His 


Pro 


Val 
170 


Arg 


Arg 


Ala 


Ala 


Val 
175 


Lys 


He 


Leu 


Val His 
180 


Ser 


Leu 


Phe 


Asn 


Met 


Leu 


He 


Met 


Cys 


Thr 


He 


Leu 


Thr 


Asn Cys 










185 








190 








195 


Val 


Phe 


Met 


Ala 


Gin 


His 


Asp 


Pro 


Pro 


Pro 


Trp 


Thr 


Lys 


Tyr Val 










200 








205 








210 


Glu 


Tyr 


Thr 


Phe 


Thr 


Ala 


He 


Tyr 


Thr 


Phe 


Glu 


Ser 


Leu 


Val Lys 








215 








220 








225 


He 


Leu 


Ala 


Arg 


Ala 


Phe 


.cys 


Leu 


His 


Ala 


Phe 


Thr 


Phe 


Leu Arg 








230 








235 








240 


Asp 


Pro 


Trp 


Asn 


Trp 


Leu 


Asp 


Phe 


Ser 


Val 


He 


He 


Met 


Ala Tyr 






245 








250 








255 


Thr 


Thr 


Glu 


Phe 


Val 


Asp 


Leu Gly Asn 


Val 


Ser 


Ala 


Leu 


Arg Thr 










260 








265 








270 


Phe 


Arg 


Val 


Leu Arg 


Ala 


Leu 


Lys 


Thr 


He 


Ser 


Val 


He 


Ser Gly 








275 








280 








285 


Leu 


Lys 


Thr 


He 


Val 


Gly 


Ala 


Leu 


He 


Gin 


Ser 


Val 


Lys 


Lys Leu 








290 








295 








300 
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Ala 


Asp 


Val 


Met 


Val 


Leu 


Thr 


Val 


Phe 


Cys 


Leu Ser 


Val Phe Ala 








305 










310 




315 


Leu 


lie 


Gly 


Leu 


Gin 


Leu 


Phe 


Met 


Gly Asn 


Leu Arg 


His Lys Cys 










320 










325 




330 ' 


Val 


Arg 


Asn 


Phe 


Thr 


Ala 


Leu 


Asn 


Gly Thr 


Asn Gly 


Ser Val Glu 










335 










340 




345 


Ala 


Asp 


Gly 


Leu 


Val 


Trp 


Glu 


Ser 


Leu Asp 


Leu Tyr 


Leu Ser Asp 










350 










355 




360 


Pro 


Glu 


Asn 


Tyr 


Leu 


Leu 


Lys 


Asn 


Gly Thr 


Ser Asp 


Val Leu Leu 










365 








370 




375 


Cys 


Gly 


Asn 


Ser 


Ser 


Asp 


Ala 


Gly 


Thr 


Cys 


Pro Glu 


Gly Tyr Arg 










380 










385 




390 


Cys 


Leu 


Lys 


Ala 


Gly 


Glu 


Asn 


Pro 


Asp His 


Gly Tyr 


Thr Ser Phe 








395 










400 




405 


Asp 


Ser 


Phe 


Ala 


Trp 


Ala 


Phe 


Leu 


Ala 


Leu 


Phe Arg 


Leu Met Thr 










410 










415 




420 


Gin 


Asp 


Cys 


Trp 


Glu 


Arg 


Leu 


Tyr 


Gin 


Gin 


Thr Leu 


Arg Ser Ala 










425 










430 




435 


Gly 


Lys 


He 


Tyr 


Met 


He 


Phe 


Phe 


Met 


Leu 


Val He 


Phe Leu Gly 




440 










445 




450 


Ser 


Phe 


Tyr 


Leu 


Val 


Asn 


Leu 


He 


Leu 


Ala 


Val Val 


Ala Met Ala 




• 




455 










460 




465 




GlU 


Glu 


Gin 


Asn 


Gin 


Ala 


Thr 


He 


Ala 


Glu Thr 


Glu Glu Lys 








470 










475 




480 


GlU 


Lys 


Arg 


Phe 


Gin 


Glu 


Ala 


Met 


Glu Met 


Leu Lys 


Lys Glu His 








•485 










490 




495 


GlU 


Ala 


Leu 


Thr 


He 


Arg 


Gly 


Val 


Asp 


Thr 


Val Ser 


Arg Ser Ser 










500 






505 




510 


Leu 


Glu 


Met 


Ser 


Pro 


Leu 


Ala 


Pro 


Val 


Asn 


Ser His 


Glu Arg Arg 










515 










520 




525 


'Ser 


Lys 


Arg 


Arg 


Lys 


Arg 


Met 


Ser 


Ser 


Gly 


Thr Glu 


Glu Cys Gly 










53 0 










535 




540 


Glu 


Asp 


Arg 


Leu 


Pro 


Lys 


Ser 


Asp 


Ser 


Glu 


Asp Gly 


Pro Arg Ala 










545 










550 




555 


Met 


Asn 


His 


Leu 


Ser 


Leu 


Thr 


Arg 


Gly 


Leu 


Ser Arg 


Thr Ser Met 










560 










565 




570 


Lys 


Pro 


Arg 


Ser 


Ser 


Arg 


Gly 


Ser 


He 


Phe 


Thr Phe 


Arg Arg Arg 










575 










580 




585 


Asp 


Leu 


Gly 


Ser 


Glu 


Ala 


Asp 


Phe 


Ala 


Asp 


Asp Glu 


Asn Ser Thr 










590 










595 




600 


Ala 


Arg 


Glu 


Ser 


Glu 


Ser 


His 


His 


Thr 


Ser 


Leu Leu 


Val Pro Trp 








605 










610 




615 


Pro 


Leu 


Arg 


Arg 


Thr 


Ser 


Ala 


Gin 


Gly 


Gin 


Pro Ser 


Pro Gly Thr 






620 








625 




630 


Ser 


Ala 


Pro 


Gly 


His 


Ala 


Leu 


His 


Gly 


Lys 


Lys Asn 


Ser Thr Val 








635 










640 




645 


Asp 


Cys 


Asn 


Gly 


Val 


Val 


Ser 


Leu 


Leu 


Gly 


Ala Gly 


Asp Pro Glu 










650 










655 


> 

Pro Val 


660 


Ala 


Thr 


Ser 


Pro 


Gly 


Ser 


His 


Leu 


Leu 


Arg 


Met Leu Glu 










665 










670 




675 


His 


Pro 


Pro 


Asp 


Thr 


Thr 


Thr 


Pro 


Ser 


Glu 


Glu Pro 


Gly Gly Pro 








680 










685 




690 


Gin 


Met 


Leu 


Thr 


Ser 


Gin 


Ala 


Pro 


Cys 


Val 


Asp Gly 


Phe Glu Glu 










695 










700 




705 


Pro 


Gly 


Ala 


Arg 


Gin 


Arg 


Ala 


Leu 


Ser 


Ala 


Val Ser 


Val Leu Thr 










710 










715 




720 


Ser 


Ala 


Leu 


Glu 


Glu 


Leu 


Glu 


Glu 


Ser 


Arg 


His Lys 


Cys Pro Pro 










725 










730 




735 


Cys 


Trp 


Asn 


Arg 


Leu 


Ala 


Gin 


Arg 


Tyr 


Leu 


He Trp 


Glu Cys Cys 










740 










745 




750 


Pro 


Leu 


Trp 


Met 


Ser 


He 


Lys 


Gin 


Gly 


Val 


Lys Leu 


Val Val Met 










755 








760 


765 


Asp 


Pro 


Phe 


Thr 


Asp 


Leu 


Thr 


He 


Thr 


Met 


Cys He 


Val Leu Asn 








770 










775 


780 


Thr 


Leu 


Phe 


Met 


Ala 


Leu 


Glu 


His 


Tyr 


Asn 


Met Thr 


Ser Glu Phe 










785 








790 




795 


Glu 


Glu 


Met 


Leu 


Gin 


Val 


Gly 


Asn 


Leu 


Val 


Phe Thr 


Gly He Phe 
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1 



WO 01/62927 







800 




Thr 


Ala 


Glu Met Thr 


Phe 






815 




Tyr 


Phe 


Gin Gin Gly 


Trp 






830 




Leu 


Ser 


Leu Met Glu 


Leu 






845 




Val 


Leu 


Arg Ser Phe 


Arg 






860 




Ser 


Trp 


Pro Thr Leu 


Asn 




875 




Val 


Gly 


Ala Leu Gly 


Asn 






890 




Phe 


lie 


Phe Ala Leu 


Val' 






905 




Arg 


Asn 


Asn Arg Lys 


Asn 






920 




Arg 


Trp 


His Met His 


Asp 






935 




Arg 


lie 


Leu Cys Gly 


Glu 






950 




Glu 


Val 


Gly Gin Lys 


Ser 






965 




Met 


Val 


Leu Gly Asn 


Leu 






980 




Leu 


Leu 


Asn Ser Phe 


Ser 






995 


* 


Asp 


Gly 


Glu Val Asn 


Asn 






1010 




Arg Gly 


Leu Arg Phe 


Val 






1025 




Gly Leu 


Leu Arg His 


Arg 






1040 




Gin Gly 


Gin Leu Pro 


Ser 






1055 




Pro 


Pro 


Glu Thr Glu 


Lys 






1070 




Phe 


Glu 


Glu Gly Glu 


Gin 






1085 




Glu 


Pro 


Val Cys Val 


Pro 






1100 




Asp 


Gin 


Glu Glu Asp 


Glu 




1115 




Ser 


Ser 


Lys Gin Gin 


GlU 






1130 




Gly Pro 


Pro Asp Ser 


Arg 






1145 




Ser 


Ser 


Glu Ala Glu 


Ala 






1160 




Gin 


Trp 


Lys Ala Glu 


Pro 






1175 




Glu 


Asp 


Ser Cys Ser 


Glu 






1190 




Ala 


Glu 


Leu Leu Glu 


Gin 






1205 




Asp 


Pro 


Glu Asp Cys 


Phe 






1220 




Cys 


Cys 


Ala Val Asp 


Thr 






1235 




Arg 


Leu 


Arg Lys Thr 


Cys 






1250 




Glu 


Thr 


Phe lie lie 


Phe 






1265 




Ala 


Phe 


i3lu Asp lie 


Tyr 






1280 




Leu 


Leu 


Glu Tyr Ala 


Asp 






1295 



805 



Lys 


He 


He Ala 


Leu Asp 




820 


Asn 


He 


Phe Asp 


Ser He 






835 




Gly 


Leu 


Ser Arg 


Met Ser 






850 




Leu 


Leu 


Arg Val 


Phe Lys 






865 




Thr 


Leu 


He Lys 


He He 






880 




Leu 


Thr 


Leu Val 


Leu Ala 






895 




Gly 


Lys 


Gin Leu 


Leu Gly 






910 




lie 


Ser 


Ala Pro 


His Glu 






925 




Phe 


Phe 


His Ser 


Phe Leu 






940 




Trp 


He 


Glu Asn 


Met Trp 






955 




lie 


Cys 


Leu He 


Leu Phe 




970 




Val 


Val 


Leu Asn 


Leu Phe 






985 




Ala 


Asp 


Asn Leu 


Thr Ala 




1000 




Leu 


Gin 


Val Ala 


Leu Ala 






1015 




Lys 


Arg 


Thr Thr 


Trp Asp 






1030 




Pro 


Gin 


Lys Pro 


Ala Ala 






1045 




Cys 


He 


Ala Thr 


Pro Tyr 






1060 




Val 


Pro 


Pro Thr 


Arg Lys 






1075 




Pro 


Gly 


Gin Gly 


Thr Pro 






1090 




lie 


Ala 


Val Ala 


Glu Ser 






1105 




Glu 


Asn 


Ser Leu 


Gly Thr 






1120 




Ser 


Gin 


Pro Val 


Ser Gly 






1135 




Thr 


Trp 


Ser Gin 


Val Ser 




1150 


1 


Ser 


Ala 


Ser Gin 


Ala Asp 






1165 


Gin 


Ala 


Pro Gly 


Cys Gly 






1180 




Gly 


Ser 


Thr Ala 


Asp Met 






1195 




lie 


Pro 


Asp Leu 


Gly Gin 






1210 




Thr 


GlU 


Gly Cys 


Val Arg 






1225 




Thr 


Gin 


Ala Pro 


Gly Lys 






1240 




Tyr 


His 


He Val 


Glu His 




1255 




Met 


He 


Leu Leu 


Ser Ser 






1270 




Leu 


•Glu 


Glu Arg 


Lys Thr 






1285 




Lys 


Met 


Phe Thr 


Tyr Val 



1300 
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' 810 


Pro 


Tyr Tyr. 




825 


He 


Val He 




840 


Asn 


Leu Ser 




855 


Leu 


Ala Lys 




870 


Gly Asn Ser 




885 


He 


He Val 




900 


Glu 


Asn Tyr 




915 


Asp , Trp Pro 




930 


He 


Val Phe 




945 


Ala 


Cys Met 




960 


Leu 


Thr Val 




975 


He 


Ala Leu 




990 


Pro 


GlU Asp 




1005 


Arg 


He Gin 




1020 


Phe Cys Cys 




1035 


Leu 


Ala Ala 




1050 


Ser 


Pro Pro 




1065 


Glu 


Thr Gin 




1080 


Gly Asp Pro 




1095 


Asp 


Thr Asp 




1110 


Glu 


GlU Glu 




1125 


Trp 


Pro Arg 




1140 


Ala 


Thr Ala 




1155 


Trp Arg Gin 




1170 


Glu 


Thr Pro 




1185 


Thr 


Asn Thr 




1200 


Asp Val Lys 




1215 


Arg 


Cys Pro 




1230 


Val 


Trp Trp 




1245 


Ser 


Trp Phe 




1260 


Gly Ala Leu 




1275 


He 


Lys Val 




1290 


Phe 


Val Leu 




1305 



BNSDOCID: <WO 0162927A2 I > 



WO 01/62927 jph PCT/US01/06059 

Glu Met Leu Leu^Pps Trp Val Ala Tyr Gly Phe Lys^^s Tyr Phe 

1310 1315 1320 

Thr Asn Ala Trp Cys Trp Leu Asp Phe Leu lie Val Asp Val Ser 

132S 1330 1335 

Leu Val Ser Leu Val Ala Asn Thr Leu Gly Phe Ala Glu Met Gly 

1340 1345 . 1350 

Pro lie Lys Ser Leu Arg Thr Leu Arg Ala Leu Arg Pro Leu Arg 

1355 1360 1365 

Ala Leu Ser Arg. Phe Glu Gly Met Arg Val Val Val Asn Ala Leu 

1370 " 1375 1380 

Val Gly Ala He Pro Ser He Met Asn Val Leu Leu Val Cys Leu 

1385 1390 1395 

He Phe Trp Leu He Phe Ser He Met Gly Val Asn' Leu Phe Ala . 

1400 1405 1410 

Gly Lys Phe Gly Arg Cys He Asn Tyr Thr Asp Gly Glu Phe Ser ■ 

1415 1420 1425 

Leu Val Pro Leu Ser He Val Asn Asn Lys Ser Asp Cys Lys He 

1430 1435 1440 

Gin Asn Ser Thr Gly Ser Phe Phe Trp Val Asn Val Lys Val Asn 

1445 1450 1455 ! 

Phe Asp Asn Val Ala Met Gly Tyr Leu Ala' Leu Leu Gin Val Ala 1 

1460 1465 1470 

Thr Phe Lys Gly Trp Met Asp He' Met Tyr Ala Ala Val Asp Ser 

1475 1480 1485 

Arg Glu Val Asn Met Gin Pro Lys Trp Glu Asp Asn Val Tyr Met 

1490 1495 1500 

Tyr Leu Tyr Phe Val He Phe He He Phe Gly Gly Phe Phe Thr 1 i 

1505 1510 1515 

Leu Asn Leu Phe Val Gly Val He He Asp Asn Phe Asn Gin Gin 

1520 1525 1530 

Lys Lys Lys Leu Gly Gly Gin Asp He Phe Met Thr Glu Glu Gin 

1535 1540 1545 

Lys Lys Tyr Tyr Asn Ala Met Lys Lys Leu Gly Ser Lys Lys Pro 

1550 1555 1560 

Gin Lys Pro He Pro Arg Pro Leu Asn Lys Phe Gin Gly Phe Val 

1565 1570 1575 

Phe Asp He Val Thr Arg Gin Ala Phe Asp He Thr He Met Val 

<1580 1585 1590 

Leu He Cys Leu Asn Met Val His , His Asp Gly Gly Asp 

1595 1600 

<210> 316 
<211> 200 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI:1188288. l.orf3 :2000MAY01 
<400> 316 

Gly Gly Ala Gly Val Arg Gly Ala Gly Trp Arg Leu Gin Gin Arg 
15 10 15 

Gly Arg Gly Ala Arg Ala Pro Pro Ala Arg Gin Ala Arg Gin Gly 

20 25 30 

Gly Pro Arg Ala Arg Pro Leu Glu Gin Gin Gly Gly Val Arg Ala 

35 40 45 

Glu Arg Gly Arg Gly Asp His Trp Ala Trp Ala Thr Cys Gly Ala 
' 50 55 60 

Ser Pro Thr Leu Cys Tyr Lys Asn Gly Gly Gly Ala Phe Leu He 

65 70 75 

Pro Tyr Val Val Phe Phe He Cys Cys Gly He Pro Val Phe Phe 

80 85 90 

Leu Glu Thr Ala Leu Gly Gin Phe Thr Ser Glu Gly Gly He Thr 

95 ~ 100 105 

Cys Trp Arg Lys Val Cys Pro Leu Phe Glu Gly He Gly Tyr Ala 
110 115 120 



166/228 



BNSDOCID: <WO 0162927A2J_> 




WO 01/62927 , PCT/US01/06059 



Thr 


Gin 


Val 


lie 


Glu 


Ala His 


Leu 


Asn 


Val Tyr 


Tyr lie lie 


He 










125 








130 




135 


Leu 


Ala 


Trp 


Ala 


lie 


Phe Tyr 


Leu 


Ser 


Asn Cys 


Phe Thr Thr 


Glu 








140 






145 




150 


Leu 


Pro 


Trp 


Ala 


Thr 


Cys Gly His Glu Trp Asn Thr Glu Asn Cys 








155 








160 




165 


Val Glu 


Phe 


Gin 


Lys 


Leu Asn 


Val 


Ser 


Asn Tyr 


Ser His Val 


Ser 










170 








175 




180 


Leu 


Gin 


Lys 


Cys 


His 


Leu Pro 


Cys 


His 


Gly Val 


Leu Gly Ala 


Pro 








185 






190 




195 



Gly Pro Gly His Leu 
200 

<210> 317 
<211> 329 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: LI : 427997 . 4 . orf 3 : 2000MAY01 
<400> 317 



Arg 


Thr 


Gly Gly Gly 


v? j.y 


Gly Ser Gin 


Ser 


Pro 


Ala 


Pro 


Asp 


Gly 










5 




















15 


GlU 


Arg 


Thr 


Met 


His 


Cys 


Leu Gly Ala 


GlU 


Tyr 


Leu 


Val 


Ser 


Ala 










20 




















30 


Asp 




Ala 


Pro 


Arg 


m n 

V71X1 


Arg Glu Trp 


ax y 


Pro 


Gin 


He 


Tyr 


Arg 










35 




















45 


TiVs 

J-ljr O 


Cys 


Tnr 


Asp 


Thr 


Ala 


Trp 


Leu 


Phe 


Leu 


Phe 


Phe 


Leu 


Phe 


Trp 








50 










55 










60 


-L Hi. 




Leu 


Val 


Phe 


Tie 

IXC 


Met Gly Tyr 


Ser 


Val 


Val 


Ala Gly Ala 








65 










70 










75 


Ala 


Glv 


Arg 


Leu 


Leu 


Phe 


Gly Tyr Asp 


Ser 


Phe Gly Asn Met 


Cys 










80 










85 










90 


GlV 


Lys 


Lys 


Asn 


Ser 


Pro 


Val 


Glu Gly 


Ala 


Pro 


Leu 


Ser Gly 


Gin 






95 










100 










105 


Asp 


Met 


Thr 


Leu 


Lys 


Lys 


His 


Val 


Phe 


Phe 


Met 


Asn 


Ser 


Cys 


Asn 








110 








115 










120 


Leu 


GlU 


Val 


Lys 


Gly 


Thr 


Gin 


Leu 


Asn 


Arg 


Met 


Ala 


Leu 


Cys 


Val 








125 










130 










135 


Ser 


Asn 


Cys 


Pro 


Glu 


Glu 


Gin Leu Asp 


Ser 


Leu 


Glu 


Glu 


Val 


Gin 








140 










145 










150 


Phe 


Phe 


Ala 


Asn 


Thr 


Ser 


Gly 


Ser 


Phe 


Leu 


Cys 


Gly Tyr 


Ser 


Leu 










155 








160 










165 


Asn 


Ser 


Phe 


Asn 


Tyr 


Thr 


His 


Ser 


Pro 


Lys 


Ala Asp 


Ser 


Leu 


Cys 










170 










175 










180 


Pro 


Arg 


Leu 


Pro 


Val 


Pro 


Pro 


Ser 


Lys 


Ser 


Phe 


Pro 


Leu 


Phe 


Asn 








185 










190 










195 


Arg 


Cys 


Val 


Pro 


Gin 
200 


Thr 


Pro 


Glu 


Cys 


Tyr 
205 


Ser 


Leu 


Phe 


Ala 


Ser 
210 


Val 


Leu 


He 


Asn 


Asp 


Val 


Asp 


Thr 


Leu 


His 


Arg 


He 


Leu 


Ser Gly 










215 










220 










225 


He 


Met 


Ser 


Gly Arg 


Asp 


Thr 


He 


Leu 


Gly 


Leu 


Cys 


He 


Leu 


Ala 










230 










235 










240 


Leu 


Ala 


Leu 


Ser 


Leu 
245 


Ala 


Met 


Met 


Leu 


Thr 
250 


Val 


Gin 


He 


His 


Thr 
255 


Pro 


Pro 


Phe 


Trp 


Phe 


Thr 


Phe 


Ser 


Phe 


His 


Trp 


Leu 


Phe 


Trp 


Asp 








260 










265 










270 


Cys 


Cys 


Leu 


Val 


Cys 


Gly 


Val 


Leu 


Trp 


Trp 


Leu 


Tyr 


Tyr Asp 


Tyr 










275 










280 










285 


Thr 


Asn 


Asp 


Leu 


Ser 


He 


Glu Leu Asp 


Thr 


Glu Gin Gly Lys 


Tyr 








290 










295 










300 


GlU 


Val 


Arg Ala Gly 


Val 


Cys 


Tyr 


Arg 


Asn 


Pro 


Gin Gly 


He 


Thr 










305 










310 










315 


Ala 


Asp 


Ala Ala Arg 


Leu 


Asp 


He 


Leu 


Phe 


Ser Glu Arg Glu 










320 








325 
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<210> 318 
<211> 256 
<212> PRT 

<213> Homo sapiens ■ 
<220> 

<221> mis cofeature 

<223> Incyte.IDNo: LG:451682 . 1 .orf 3 : 2000FEB18 
<400> 318 

His Leu Ala Ala Ala Ala Met Ser Arg Arg Tyr Asp Ser Arg Thr 
1 5 10 , 15 

Thr He Phe Ser Pro Glu Gly Arg Leu Tyr Gin Val Glu Tyr Ala 
20 25 30 

Met Glu Ala He Gly Asn Ala Gly Ser Ala Leu Gly lie Leu Ala 
35 40 45 

Ala Asp Gly Val Val Leu Val Gly Glu Lys Lys Val Thr Ser Lys 
50 55 60 

Leu Leu Gin Thr Ser Arg Ser Ala Glu Lys Met Tyr Lys He Asp 
65 70 .75 

Ser His Leu Ala Cys Ala Val Ala Gly He' Met Ser Asp Ala Asn 
80 85 * . 90 

He Leu He Asn Thr Ala Arg Leu His Ala Gin Arg Tyr Ala Leu 
?5 100 105 

Ser Tyr Gin Glu Pro He Pro Val Glu Gin Leu Val Gin Ser Leu 
110 115 120 

Cys Asp Thr Lys Gin Gly Tyr Thr Gin Phe Gly Gly . Leu Arg Pro 
125 130 135 

Phe Gly Val Ser Phe Leu Phe Ala Gly Trp Asp Lys His His Gly 
140 145 150 

Phe Gin Leu Tyr Met Ser Asp Pro Ser Gly Asn Tyr Gly Gly Trp 
155 160 165 

Lys Ala Ala Ala Val Gly Ala Asn Ser Gin Ala Ala Gin Ser Met 
170 175 180 

Leu Lys Gin Asp Tyr Lys Asp Ala Leu Thr Arg Glu Glu Ala Val 
185 190 195 

Gly Leu Ala Leu Lys Val Leu Ser Lys Thr Met Asp Ser Thr Ser 
1 200 205 210 

Leu Thr Ala Glu Lys Leu Glu Leu . Ala Glu Val Phe Leu Gin Pro 
215 220 225 

Asp Thr Gly Glu Val Gin Tyr Gin Val Cys Ser Pro Glu Ala Leu 
230 235 240 

Gly Lys Leu Leu Ala Asn Ser Gly Leu Thr Gin Pro Thr Pro Glu 
245 250 255 

Ala 



<210> 319 

<211> 76 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 1077283 . 1 .orf 2 :2000FEB18 
<400> 319 

Ala Val Ser Phe Arg Arg Leu Leu Gin Thr Trp Ser Thr Pro Pro 
15 10 15 

Cys Ser Ser Thr Ser Arg Leu Met Ala Ser Leu Trp Val Ala Val 

20 25 30 

Trp Leu Arg Lys Trp Leu Ala Asp Lys Val Pro Lys Thr Ala Glu 

35 40 45 

Asn Phe Arg Ala Leu Ser Thr Gly Glu Lys Gly Phe Gly Tyr Asn 

50 55 60 

, Gly Phe Leu Leu Ser Gin Asn Tyr Ser Arg He His Val Pro Gly 

65 70 75 
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<210> 320 
<211> 276 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No; LG: 481436 . 5 . orf 3 : 2000FEB18 ' 

<400> 320 i 
Thr Asn lie Lys lie Thr Met Lys Val' Leu Gly Val Thr Lys Asp 
1 5 10 15 

Ser Gly Asp Glu Asp Leu Lys Lys Ala Tyr Arg Lys Leu Ala Leu 
20 25 30 

Lys Phe His Pro Asp Lys Asn His Ala Pro Gly Ala Thr Asp Ala 

35 40' 45 1 

Phe Lys Lys lie Gly Asn Ala Tyr Ala Val Leu Ser Asn Pro Glu 
50 55 60 

Lys Arg Lys Gin Tyr Asp Leu Thr Gly Asn Glu Glu Gin Ala Cys 
65 70 75 

Asn His Gin Asn Asn Gly Arg Phe Asn Phe His Arg Gly Cys Glu 

80 85 90 i 

Ala Asp lie Thr Pro Glu Asp Leu Phe Asn lie Phe Phe Gly Gly 
95 100 105 

Gly Phe Pro Ser Gly Ser Val His Ser Phe Ser Asn Gly Arg Ala 

110 115 120 

Gly Tyr Ser Gin Gin His Gin His Arg His Ser Gly His Glu Arg 

125 130 135 

Glu Glu Glu Arg Gly Asp Gly Gly Phe Ser Val Phe lie Gin Leu 

140 145 150 

Met Pro lie lie Val Leu lie Leu Val Ser Leu Leu Ser Gin Leu 

155 160 165 

Met Val Ser Asn Pro Pro Tyr Ser Leu Tyr Pro Arg Ser Gly Thr 

170 175 180 

Gly Gin Thr lie Lys Met Gin Thr Glu Asn Leu Gly Val Val Tyr 

185 190 195 

Tyr Val Asn Lys Asp Phe Lys Asn Glu Tyr Lys Gly Met Leu Leu 

200 205 210 

Gin Lys Val Glu Lys Ser Val Glu Glu Asp Tyr Val Thr Asn lie 

215 220 225 

Arg Asn Asn Cys Trp Lys Glu Arg Gin Gin Lys Thr Asp Met Gin 

230 235 240 

Tyr Ala Ala Lys Val Tyr Arg Asp Asp Arg Leu Arg Arg Lys Ala 

245 250 255 

Asp Ala Leu Ser Met Asp Asn Cys Lys Glu Leu Glu Arg Leu Thr 

260 265 270 

Ser Leu Tyr Lys Gly Gly 

275 

<210> 321 
<211> 115 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: LI :793701 . 1 .orf 1 :2000FEB01 
<400> 321 

Gin Ala Leu Leu Gin Ser His Pro Glu Ala Asp Trp Ser Thr His 
15 10 15 

Ser Arg Ser Met Arg Lys Leu lie Val Arg Phe lie Phe Leu Lys 
20 25 30 

Phe Trp Thr Tyr Thr Val Arg Ala Ser Thr Asp Leu Thr <3ln Thr 
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^35 








40 


45 


Gly Asp Cys Ser 


Gin 


Cys 


lie His 


Gin 


Val 


Thr Glu Val Gly Gin 




50 








55 


60, 


Gin lie Lys Thr 


lie 


Phe 


Leu Phe Tyr 


Ser 


Tyr Tyr Glu Cys Val" 




65 








. 70 


75 


Glu Thr lie Lys 


Lys 


Leu 


Val Cys 


lie 


Met 


Pro Leu Ser Thr Arg 




80 








85 


90 


Tyr Val Ala Arg 


Glu 


Met 


Thr Asp 


Leu 


Met 


Arg Val He Thr His 




95 








100 


105 


Leu Ser Pro Pro 


Gin 


Pro 


Pro Phe 


Leu 


Lys 






110 








115 





<210> 322 
<211> 227 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 373 637 . 1 .orf 3 :200 0FEB01 
<400> 322 



His 


Pro 


Ala 


Trp Trp 


Thr Thr Thr Arg Cys 


Trp 


Thr Cys Pro Gly 


1 






5 




10 




15 


Arg 


Pro 


His 


Pro Arg 


Pro Ser Arg Arg Arg 


Thr 


Ala Ser Trp Arg 








20 




25 




30 


Ser 


Ser 


Gly 


Thr Pro 


Thr Lys Thr Leu Arg 


Thr 


Arg Arg Lys Arg 








35 




40 




45 


Arg 


Gly 


Asp 


Ser Ser 


Arg Trp Pro Arg 


Pro 


Thr 


Arg Cys Cys Arg 








50 




55 




60 


Thr 


Pro 


Arg 


Asn Ala 


He Ser Met Thr 


Ala 


Met 


Ala Arg Arg Gly 








65 




70 




75 


Arg 


Arg 


Ala 


Ala Ala 


Gin Ala Ala Gly 


Pro 


Ser 


Arg Thr Pro Ser 








80 




85 




90 


Ser 


Thr 


Ser 


Ser Ala 


Ser Ala Thr Gin 


Pro 


Thr 


Ser Ser Gly Ser 








95 




100 




105 


Ser 


Ser 


Ala 


Ala Arg 


Thr His Ser Pro 


Leu 


Thr 


Ser Trp Glu Thr 








110 




115 




120 


Arg 


Trp 


Arg 


He Phe 


Trp Gly Gly Gin 


Arg 


Asn 


Cys Trp Gly Ser 








125 




13 0 




135 


Arg 


Ser 


Arg 


Ala Ser 


Ala Pro Leu Phe 


Ser 


Ala 


Phe Ser Glu Phe 






140 




145 




150 


Pro 


Ala 


Phe 


Gly Gly 


Val Phe Ser Ser 


Phe 


Asp 


Thr Gly Phe Arg 








155 




160 




165 


Ser 


Phe 


Gly 


Ser Leu 


Gly Ser Gly Gly 


Leu 


Ser 


Ser Phe Cys Met 








170 




175 




180 


Ser 


Tyr 


Gly 


Ser Asp 


Gly Thr Gly Ser 


Phe 


Lys 


Ser Met Ser Thr 








185 




190 




195 


Ser 


Thr 


Glu 


He Val 


Asp Gly Lys Lys 


He 


Thr 


Thr Lys Arg He 








200 




205 




210 


He 


Glu 


Asn 


Gly Gin 


Glu Arg Val Glu 


Val 


Glu 


Glu Asp Gly Glu 








215 




220 




225 


Leu 


Ser 















<210> 323 
<211> 100 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 239368 . 2 .orf 3 : 2000MAY19 
<400> 323 

Glu Glu Ser Val Leu Arg Gly Lys Phe Leu Phe Thr Ser Gly He 
15 10 15 



170/228 



BNSDOCID: <WO 01 62927 A2_L> 



WO 01/62927 

Pro Arg Ala Ser Trp 
20 

Ser Pro Gly Ser Ala 
35 

Gly Leu Gly Ala Arg 
50 

Phe . Leu His Leu Leu 
65 

Pro Ala Asp Ala Ser 
80 

Ala Ser Pro Thr Pro 
95 

<210> 324 
<211> 142 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: LI : 053826 . 1 . orf 3 :2000MAY01 

<400> 324 



Asp 


Phe 


Trp Ala 


Lys 


lie Tyr 


Leu 


Tyr 


Ala 


Leu 


Glu Gly Arg Lys 


1 






5 








10 






15 


Tyr 


Arg 


Ser He 


Leu 


Gin Leu 


Val 


Lys 


Pro 


Trp 


Tyr Asp Glu 


Val 






20 








25 






30 


Lys 


Asp 


Tyr Ala 


Phe 
35 


Pro Tyr 


Pro 


Gin 


Asp 
40 


Cys 


Asn Pro Arg 


Cys 
45 


Pro 


Met 


Arg Cys 


Phe 


Gly Pro Met 


Cys 


Thr 


His 


Tyr Thr Gin 


Met 






50 








55 






60 


Val 


Trp 


Ala Thr 


Ser 
65 


Asn Arg 


He 


Gly 


Cys 
70 


Ala 


He His Thr 


Cys 
75 


Gin 


Asn 


Met Asn 


Val 


Trp Gly Ser Val 


Trp 


Arg 


Arg Ala Val 


Tyr 








80 








85 






90 


Leu 


Val 


Cys Asn 


Tyr 


Ala Pro 


Lys 


Gly 


Asn 


Trp 


He Gly Glu Ala 








95 








100 






105 


Pro 


Tyr 


Lys Val 


Gly 
110 


Val Pro 


Cys 


Ser 


Ser 
115 


Cys 


Pro Pro Ser 


Tyr 
120 


Gly 


Gly 


Ser Cys 


Thr 
125 


Asp Asn 


Leu 


Cys 


Phe 
130 


Pro 


Gly Val Thr 


Ser 
135 


Asn 


Tyr 


Leu Tyr 


Trp 
140 


Phe Lys 















<210> 325 
<211> 263 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: LI :449393 . 1 .orf 3 :2000MAY01 
<400> 325 



Ala 


Met 


Ser 


Leu 


Arg 


Val 


Leu 


Asn 


Pro 


Asn 


Ala 


Glu Val Leu Asn 


1 








5 










10 




15 


Lys 


Ser 


Ala 


Ala 


Leu 


His 


Met 


Asn 


He 


Asn 


Ala 


Ala Lys Gly Leu 








20 










25 




30 


Gin 


Asp 


Val 


Leu 


Lys 


Thr 


Asn 


Leu Gly 


Pro 


Lys 


Gly Thr He Lys 










35 










40 




45 


Met 


Leu 


Val 


Gly 


Gly 


Ala 


Gly 


Asp 


Leu 


Lys 


Leu 


Thr Lys Asp Gly 










50 








55 




60 


Asn 


Thr 


Leu 


Leu 


Lys 


Glu 


Met 


Gin 


He 


Gin 


Asn 


Pro Thr Ala He 










65 










70 




75 


Met 


He 


Ala 


Arg 


Thr 


Ala 


Val 


Ala 


Gin 


Asp 


Asp 


Thr Ser Gly Asp 










80 










85 




90 


Gly 


Thr 


Thr 


Ser 


Thr 


Val 


Leu 


Phe 


He 


Gly 


Glu 


Leu Met Lys Gin 



171/228 



PCT/US01/060S9 



VaT 


Asp 


Ser 


Gly 


Leu 


His Thr 


Gin 


Pro^Gly 






25 






30 


Ser 


Val 


Pro 


Pro 


Leu 


Ser Gly Pro <31y Cys 










• 40 






45 


Pro 


Ser 


Leu 


Ala 


Pro 
55 


Gly Asn 


Ser 


Asp Val 
60 


Pro 


Leu 


Leu 


Arg 


Gly 
70 


Pro Lys 


Pro 


Gly Lys 
75 


His 


Pro 


Glu 


Asn 


Cys 
85 


Glu Gin 


Thr 


His Arg 
90 


Glu 


Ser 


Ser 


Cys 


Cys 
100 









BNSPOCIP: <WO_0162927A2J_> 



WO 01/62927 PCT/US01/06059 



55 100 105 

Ser Glu Arg Cys He Asp Glu' Gly Thr His Pro Arg Phe Leu Val 

110 ' 115 120 

Asp Gly Phe Asp Val, Ala Lys Arg Ala Cys Leu Asp Phe Leu Asp 

125 130 135 

Lys Phe Lys Thr Pro Val Val Thr Gly Glu Glu Pro Asp Arg Asp 

140 145 150 

Thr Leu Lys Met Val Ala Arg Thr Thr Leu Arg Thr Lys Leu Tyr 

' 155 160 165 

Glu Gly Leu Ala . Asp Gin Leu Thr Asp He Val Val Asn Ala Val 

170 175 180 

Leu Crys He Arg Lys Pro Asp Glu Pro He Asp Leu Phe Met Val 

185 190 ' 195 

Glu He Met His Met Arg His Lys Phe Asp Val Asp Thr Arg Leu 

200 205 210 

Val Glu Gly Leu Val Leu Asp His Gly ■ Ser Arg His Pro Asp Met 

. 215 220 225 

Lys Arg Arg Ala Glu Asn Cys Tyr He Leu Thr Cys Asn Val Ser 

230 235 240 

Leu Glu Tyr Glu Lys Ser Glu He Asn Ala Gly Phe Phe Tyr Ser 

245 250- 255 
Asn Ala Glu Gin Lys Lys Lys Lys 
260. 

<210> 326 ' 

<211> 357 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 1071427 . 96 . orf 2 :2000MAY01 
<400> 326 

Asp Glu Thr Arg Glu Asp Ala Met Ala Met Val Asp His Cys Leu 
15 10 15 

Lys Lys Ala Leu Trp Phe Gin Gly Arg Cys Val Lys Val Asp Leu 
» ,'20 25 30 

Ser Glu Lys Tyr Lys Lys Leu Val Leu Arg He Pro Asn Arg Gly 
35 40 ' 45 

He Asp Leu Leu Lys Lys Asp Lys Ser Arg Lys Arg Ser Tyr Ser 
50 55 60 

Pro Asp Gly Lys Glu Ser Pro Ser Asp Lys Lys Ser Lys Thr Asp 
65 70 75 

Gly Ser Gin Lys Thr Glu Ser Ser Thr Glu Gly Lys Glu Gin Glu 
80 85 90 

Glu Lys Ser Gly Glu Asp Gly Glu Lys Asp Thr Lys Asp Asp Gin 
95 100 105 

Thr Glu Gin Glu Pro Asn Met Leu Leu Glu Ser Glu Asp Glu Leu 
110 115 120 

Leu Val Asp Glu Glu Glu Ala Ala Ala Leu Leu Glu Ser Gly Ser 
125 130 135 

Ser Val Gly Asp Glu Thr Asp Leu Ala Asn Leu Gly Asp Val Ala 
140 145 150 

Ser Asp Gly Lys Lys Glu Pro Ser Asp Lys Ala Val Lys Lys Asp 
155 160 165 

Gly Ser Ala Ser Ala Ala Ala Lys Lys Lys Leu Lys Lys Arg Arg 
170 175 180 

Phe Pro Gly Ser Met Glu Gly Phe Val Thr Leu Asp Glu Val Gly 
185 190 195 

Asp Glu Glu Asp Ser Glu Leu Gin Lys Leu Arg Lys Ser Gly Met 
200 205 210 

Ala Phe Lys Ser Gly Asp Lys Asn Asp Asp Gly Leu Val Glu He 
215 220 225 

Lys Val Asp Lys He Glu Glu Leu Asp Gin Glu Asn Glu Ala Ala 
230 235 240 

Leu Glu Asn Gly He Lys Asn Glu Glu Asn Thr Glu Pro Gly Ala 
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245 250 ^^55 



Glu 


Ser 


Ser 


Glu Asn 


Ala Asp Asp Pro 

• 


Asn 


Lys 


Asp 


Thr 


Ser 


Glu 








260' 


265 










270 


Asn 


Ala 


Asp 


Gly Gin 


Ser Asp Glu Asn 


Lys 


Asp 


Asp 


Tyr 


Thr 


He 






275 


280 










285 


Pro 


Asp 


Glu 


Tyr Arg 


He Gly Pro Tyr 


Gin 


Pro 


Asn 


Val 


Pro 


Val 










O Q K 

a y o 












Gly 


He 


Asp 


Tyr Val 


He Pro Lys Thr 


Gly 


Phe 


Tyr 


Cys 


Lys 


Leu 






305 




310 










315 


Cys 


Ser 


Leu 


Phe Tyr 


Thr Asn Glu Glu 


Val 


Ala 


Lys 


Asn 


Thr 


His 






320 




325 










330 


Cys 


Ser 


Ser 


Leu Pro 


His Tyr Gin Lys 


Leu 


Lys 


Lys 


Phe 


Leu 


Asn 






335 


340 










345 


Lys 


Leu 


Ala 


Glu Glu 


Arg Arg Gin Lys 


, Lys 


Glu 


Thr 









350 355 

<210> 327 
<211> 100 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: LI :33 6338 . 8 *orf 2 : 2000MAY61 
<220> 

<221> unsure 
<222> 10/ 18, 30 
<223> unknown or other 

<400> 327 



Met 


He 


Ser 


Ser 


Asn 


Ser 


Pro Asn 


Leu 


Xaa 


Leu Trp 


Pro 


He 


Thr 


1 








5 








10 








15 


Thr 


Phe 


Xaa 


His 


Val 
20 


Cys 


Thr Ser 


Cys 


Cys 
25 


Ser Arg 


Leu 


Gin 


Xaa 
30 


Pro 


Phe 


Ser 


Leu, 


Ala 


Asp 


Phe Trp 


Lys 


Ser 


Asn Gly Arg Val 


Leu 








35 








40 








45 


Gly 


Gly 


Arg 


Arg 


Leu Leu 


Tyr Ala Cys 


GlU 


Lys Glu 


Gin 


Ser 


Val 








50 








55 








60 


Pro 


Thr 


Glu 


Gly 


Ser 


Ser 


Thr Thr 


Leu 


Leu 


Gin Asn 


Met 


Tyr 


He 








65 








70 








75 


Ser 


Arg 


Leu 


Ser 


Ser 


His 


Leu Arg 


Phe 


Leu 


Cys Ser 


Cys 


Arg 


Leu 








80 








85 








90 


He 


Asp 


Tyr 


Ser 


He 
95 


Leu 


Leu Lys 


Arg 


Lys 
100 











<210> 328 
<211> 303 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG : 345527 . 1 . orf 2 : 2000FEB18 
<400> 328 



Arg 


Glu 


Leu 


Lys 


Arg 


Phe 


Asn 


Ala Asp Asn 


Lys 


Leu Leu Leu 


Thr 


1 






5 






10 






15 


Gly 


Thr 


Pro 


Leu 


Gin 


Asn 


Asn 


Leu Ser Glu 


Leu 


Trp Ser Leu 


Leu 








20 






25 






30 


Asn 


Phe 


Leu 


Leu 


Pro 


Asp 


Val 


Phe Asp Asp 


Leu 


Lys Ser Phe 


Glu 










35 






40 






45 


Ser 


Trp 


Phe 


Asp 


He 


Thr 


Ser 


Leu Ser Glu 


Thr 


Ala Glu Asp 


He 








50 






55 






60 


He 


Ala 


Lys 


Glu 


Arg 


Glu 


Gin 


Asn Val Leu 


His 


Met Leu His 


Gin 








65 






70 






75 


He 


Leu 


Thr 


Pro 


Phe 


Leu 


Leu 


Arg Arg Leu 


Lys 


Ser Asp Val Ala 
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80 




Leu 


Glu 


Val 


Pro 


Pro 
95 


J— i_y s y 


S ex- 


Lys 


Lys 


Gin 


Glu 


lie Phe 








110 




ile 


Ala 


Asn 


Met 


.Phe 


Gly Ser 










125 


Ser 


Pro 


Thr 


Glv 


Arcr 

ax y 

140 


Pro Lys 


r T\rr~ 
A Jf A 


Ser 


Lys 


lie 


Asp 
155 


Acn Phe 


Ser 


Gin 


lie 


Gin 


Pro 
170 


Glu Val 


vai 


A CT1 


Tl e 

a Ats 


pr-A 


Val 

VAX 

185 


fil 11 Cpr 

OIU OCX 


lie 

A A G 


Met 


Met 


Leu 


Leu 


•fii. y jl 










200 


Vjf A U. 


■Or* 


-t A U 


lie 


A QT> 

215 


Pro Val 


CSl 11 


T .oil 
J_ifc= IA 


Val 

V GLA 


Allx 


Z. -J u 


CJoy* fAlvr 

Oci Uljf 


Leu 


Pro 


GlU 


Leu 


Lys 
245 


Lys Arg 


Gin 


Met 


Thr 


Ser 


Met 
260 


Leu Asp 


Arg 


Asp 


Phe 


Asn 


Phe 
275 


Ser Arg 


Arg 


Glu 


Lys 


Lys 


Thr 
290 


Cys Thr 


Leu 


Ser 


Ser 













85 




90 


Glu 


Val 


Val 


Val 


Tvr Ala Pro Leu 






100 




105, 


Tyr 


Thr 


Ala 


He 


Val Asn Arg Thr 




115 




120 


ser 


GlU 


Lvs 


Glu 


Thr He Glu Leu 






130 




135 


Arg Arg 


Thr 


Arg 


Lvs Ser He Asn 






145 




150 


Pro 


Asn 


Glu 


Leu 


Glu Lys Leu He 






160 




165 


Asp Arg 


Glu 


Arg 


Ala Val Val Glu 






175 




180 


Glu 


Val 


Asn 


Leu 


Lvs Leu Gin Asn 






190 




195 


Cys 


Cys 


Asn 


His 


Pro Tvi" Leu Tie 

f X x</ XJf X J-I^V-L XX^ 






205 




210 


Thr 


Gin 


Glu 


Phe 


r,vs Tie A c«t> Olil 

AV jr O AXCS xlO U1U 






220 




225 


Lys 


Phe 


Leu 


Tie 

A A w 


T»en Acj"d Aitt Met 
jjcu jn,o^f my vies i> 






235 




240 


Gly His 


Lys 


Val 


Leu Leu Phe Ser 






250 




255 


He 


Leu 


Met 


Asp 


Tyr Cys His Leu 






265 




270 


Leu 


Met 


Gly 


Pro 


Cys Leu Thr Gin 






280 




285 


Ala 


Ser 


Thr 


Arg 


He Gin Arg Cys 






295 




300 



<210> 329 
<211> 72 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: LG: 1089383 . 1 .orf 2 : 2000FEB18 
<400> 329 



Thr 


Ala Leu Leu 


Leu 


Thr Gin Ser Leu Phe Gly Ser Leu Phe Thr 


1 




5 


10 


15 


Trp 


Thr Arg Val 


Thr 


Phe Gly Ala Glu Asp 


Pro Gly Gin Glu Asp 






20 


25 


30 


Ser 


Phe Arg Arg 


Arg 


Val Pro Cys Pro Cys 


Pro His Ser Val Arg 






35 


40 


45 


Arg 


Ser Thr Tyr 


Asp 


Leu Arg Ser Ser Asp 


Gin Pro Ala Gin Gly 






50 


55 


60 


Thr 


Ser His Glu 


Phe 


Gin He Gly Phe Pro 


Thr He 






65 


70 





<210> 330 

<211> 76 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mi sc_f eature 

<223> Incyte ID No: LG: 1092522 . 1 . orf 2 : 2000FEB18 
<400> 330 

Phe Ser Tyr Leu Ser Ser Lys Trp Val Val Lys Gin Gin Arg Gin 
1 5 10 .15 

Leu Ala He Ser Thr Met His Leu Ala Gin Glu Leu Leu Met Asn 
20 25 30 
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Val Gin Cys Ser Gly Gly Ser Arg His Phe Ser Lys Glu MetTTlrg 
35 40 45 

Thr Leu Lys Met Arg Ser lie Val .Ala Lys Pro Leu Glu Val Asp 
50 55 60 

Asn Asp Gin Leu Arg Ala lie Ser Lys Ala Asp Pro Leu Lys Ala 
65 70 75 

Thr 



<210> 331 

<211> 74 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: LG: 1093216 . 1 .orf 2 :2000FEB18 
<220> 

<221> unsure 

<222> 2, 17, 36 

<223> unknown or other 

<400> 331 

Gly Xaa Pro Pro Thr Thr Ser Gly Pro Gin Thr Asn Gin Pro Lys 
1 5 10 15 

Glu Xaa Leu Met Asn Phe Lys Ser Asp Ser Gin Leu Tyr Glu Asp 

20 25 30 

Thr Leu Ala Gly Arg Xaa Val Leu lie Lys Asn Leu Thr Pro Gin 

35 40 45 

Thr Leu Gin Pro Arg Trp Thr Gly Pro Tyr Leu Val lie Tyr Ser 

50 55 60 

Thr Pro Thr Ala Val Arg Leu Gin Asp Pro Pro His Trp Val 

65 70 



<210> 332 , , 

<211> 67 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: LI : 270318 . 3 . orf 3 : 2000FEB01 



<400> 332 

Leu Ser Phe Lys Arg Asp Ser Trp Glu Tyr Gly His Pro Ala Pro 
15 10 15 

Arg Cys Gly Asn Glu Ser Ser Arg Ser Gly Glu Ala Ala Leu Ala 

20 25 30 

Asp Val Gin Leu Ala Ala Pro Val Ser Asn Gin Leu His Pro Asp 

35 40 45 

Gly Val Glu Asp Arg Gly Val Gly Gly Leu Leu Arg Ser Tyr Thr 

50 55 60 

Thr Gin Leu Thr Met Asn lie 

65 



<210> 333 
<211> 192 
<212> PRT 
<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: LI : 33 5671 . 2 . orf 2 : 2000FEB01 
<400> 333 

Trp Val His Val Leu Leu Arg Glu Arg Lys Lys His Ala Gin Leu 
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1 














10 






15 


Gin 


His 


Gly Ser 


Arg 


Gly 


Val 


Tyr Leu 


Leu 


Val 


Ser Thr Arg 


Ala 








20 








25 






30 


Gly 


Gly 


Leu Gly 


He 


Asn 


Leu 


Thr Ala 


Ala 


Asp 


Thr Val He 


He ' 








35 








40 






45 


Tyr 


Asp 


Ser Asp 


Trp 


Asn 


Pro 


Gin Ser 


Asp 


Leu 


Gin Ala Gin 


Asp 








50 








55 






60 


Arg 


Cys 


His Arg 


He 


Gly 


Gin 


Thr Lys 


Pro 


Val 


Val Val Tyr 


Arg 






65 






70 






75 


Leu 


Val 


Thr Ala 


Asn 


Thr 


He 


Asp Gin 


Lys 


He 


Val Glu Arg 


Ala 








80 






85 




90 


Ala 


Ala 


Lys Arg 


Lys 


Leu 


Glu 


Lys Leu 


He 


He 


His Lys Asn 


His 








95 








100 






105 


Phe 


Lys 


Gly Gly 


Gin 


Ser 


Gly 


Leu Asn 


Leu 


Ser 


Lys Ash Phe 


Leu 








110 




• 


115 






120 


Asp 


Pro 


Lys Glu 


Leu 


Met 


Glu 


Leu Leu 


Lys 


Ser 


Arg Asp Tyr 


Glu 








125 








130 






135 


Arg 


Glu 


He Lys 


Gly 


Ser 


Arg 


Glu Lys 


Val 


He 


Ser Asp Lys 


Asp 








140 








145 






150 


Leu 


Glu 


Leu Leu 


Leu 


Asp 


Arg 


Ser Asp 


Leu 


He 


Asp Gin Met 


Asn 








155 


160 




165 


Ala 


Ser 


Gly Pro 


He 


Lys 


Glu 


Lys Met 


Gly 


He 


Phe Lys He 


Leu 








170 








175 






180 


Glu 


Asn 


Ser Glu 


Asp 


Ser 


ser 


Pro Glu 


Cys 


Leu 


Phe 










185 








190 









<210> 334 

<211> 74 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI: 793758. 1 -orf2 :2000FEB01 
<220> 

<221> unsure 
<222> 36 

<223> unknown or other 
<400> 334 

Gly Asp Pro Pro Thr Thr Ser Gly Pro Gin Thr Asn Gin Pro Lys 
15 10 15 

Glu His Leu Met Asn Phe Lys Ser Asp Ser Gin Leu Tyr Glu Asp 

20 25 30 

Thr Leu Ala Gly Arg Xaa Val Leu He Lys Asn Leu Thr Pro Gin 

35 40 45 

Thr Leu Gin Pro Arg Trp Thr Gly Pro Tyr Leu Val He Tyr Ser 

50 55 ' 60 

Thr Pro Thr Ala Val Arg Leu Gin Asp Pro Pro His Trp Val 

65 70 

<210> 335 

<211> 72 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: LI : 803718 .1 .orf 2 :2000FEB01 
<220> 

<221> unsure 
<222> 41 

<223> unknown or other 
<400> 335 
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WO 01/62927 PCT7US0 1/06059 

'hr Gin Ser Leu Phe Gly Ser Leu Phe^Th 



Thr Ala Leu Leu Leu Thr Gin Ser Leu Phe Gly Ser Leu Phe^Thr 

1 5 10 15 . 

Trp Thr Arg Val Thr Phe Gly Ala Glu Asp Pro Gly Gin Glu Asp 

20 i 1 25 30 

Ser Phe Arg Arg Arg Val Pro Cys Pro Cys Xaa His Ser Val Arg 

35 .40 45 

Arg Ser Thr Tyr Asp Leu Arg Ser Ser Asp Gin Pro Ala Gin Gly 

50 , 55 60 

Thr Ser His Glu Phe Gin lie Gly Phe Pro Thr He 

65 70 

<210> 336 

<211> 55 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc — feature 

<223> Incyte ID No: LI : 412179 . 1 . orf 2 : 2000FEB01 
<400> 336 

Thr He Glu Met Met Leu Asp lie Lys Gin He Gin Val He Phe 
1 * 5 10 15 

Leu Phe Glu Phe Lys Met Gly Arg Lys He Ala Glu Thr Thr Arg 

20 25 30 

Asn He Asp Asn Ala Phe Gly Pro Gly Leu Leu Thr Asn Val Gin 

35 40 45 

Cys Ser Gly Ser Ser Arg Arg Gin Gly Ala 

50 55 

<210> 337 
<211> 107 
<212> PRT 
<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 815679 . 1 . orf 3 : 2000FEB01 



<400> 337 



Leu 


Arg 


Tyr 


He 


Asn 


Gly 


Ser 


Met 


Ser 


Ser 


Leu 


Tyr 


Pro Arg 


Leu 


1 








5 








10 








15 


Cys 


His 


Leu 


Ser 


Leu 


Gin 


Phe 


Leu 


Pro 


Leu 


Lys 


Asn 


Arg Ser 


He 








20 










25 








30 


Phe 


Leu 


Gin 


Ser 


Leu 


Met 


Leu 


Gly 


Phe 


Glu 


Leu 


Cys 


Leu Ala 


Leu 










35 










40 








45 


Ala 


Thr 


Gly 


He 


Leu 


He 


Cys 


Met 


Thr 


Lys 


Asn 


Leu 


Glu Ser 


Val 








50 








55 








60 


Asn 


Ser 


Phe 


Val 


Leu 


Ala 


His 


Ser 


Cys 


Tyr 


His 


His 


Glu Asn Lys 










65 








70 








75 


Pro 


Arg 


Pro 


Gly Cys 


Cys 


Phe 


Gin 


Gin 


Lys 


He 


Arg Asp Thr 


Lys 










80 








85 








90 


Asn 


Lys 


Val 


Glu 


Leu 


Pro 


Arg 


His 


Ala 


His 


Ala 


Arg 


Leu Thr 


Asn 



95 100 105 

Pro Gin 



<210> 338 
<211> 147 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> incyte ID No: LI : 481361 . 3 . orf 3 : 2000FEB01 
<400> 338 
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Pt*o 
"X U 


t.qii rjiw" 




M»t" TtVcs Pro CZ~\ r» 


At*/t 


Asn 


Tlrr Ast> Tien 


1 






~j 




1 0 

XV 




15 


Tien 




T,vc Tipll 


Lys 


Cpr Met Ala Tien 

OCX ilC ill d JJCU 


Ala 


Asp 


Arcr Ala Val Phe 






20, 




25 




30 


Glu 


r.vs 


Glv Met 


Lys 


Ala Phe Val 'Ser 


xyx 


Val 


Gin Ala Tvr Ala 




•j ~j 








45 


i-Jjf o 


His 




Asn 


Tien Tie Phie Attt 

UCU JLJLCS XTXiC rtiy 


Leu 


Lys 


Asd Tien Asn Phe 






50 


Glv Plie Ala Lieu 


55 




60 


Ala 


OCX 


T.etl Ala 


Arg 


Leu 


Arg 


Met Pro Lvs Met 








65 




70 




75 


Pro 


Glu 


T.fyn Arcr 


Glv 


t.vs Gin Phe Pro 


Asp 


Phe 


Val Pro Val Asn 








80 




85 




90 


Val 


Asn 


Thr Asp 


Thr 


lie Pro Phe Lys 


Asp 


Lys 


1,1 e' Arg Glu Lys 








95 




100 




105 


Gin 


Arg 


Gin Lys 


Leu 


Leu Glu Gin Gin 


Arg 


Arg 


Glu Lys Thr Glu 








110 




115 




120 


Asn 


Glu 


Gly Arg 


Arg 


Lys Phe lie Lys 


Asn 


Lys 


Ala Trp Ser Lys 








125 




130 




135 


Gin 


Lys 


Ala Lys 


Lys 


Glu Lys Lys Lys 


Lys 


Met 


Asn 








140 




145 







<210> 339 
<211> 257 
<212> PRT 
<213> Homo sapiens 

<220> ' 
<221> mis cofeature 

<223> Incyte ID No: LG : 247388 . 1 . orfl : 2000MAY19 
<400> 339 

Gin Asn Ser Val Lys Leu Ala lie Leu Tyr Leu Met Thr Phe His 
15 10 15 

Leu Gin Ala Met Val Arg Ser Ala Gly Lys Leu Val Leu lie Asp 
20 25 30 

Lys Leu Leu Pro Lys Leu Lys Ala Gly Gly His Lys Val Leu lie 
35 40 45 

Phe Ser Gin Met Val Arg Cys Leu Asp lie Leu Glu Asp Tyr Leu 
50 55 , 60 

lie Gin Arg Arg Tyr Leu Tyr Glu Arg lie Asp Gly Arg Val Arg 
65 70 75 

Gly Asn Leu Arg Gin Ala Ala lie Asp Arg Phe Ser Lys Pro Asp 
80 85 90 

Ser Asp Arg Phe Val Phe Leu Leu Cys Thr Arg Ala Gly Gly Leu 
95 100 105 

Gly lie Asn Leu Thr Ala Ala Asp Thr Cys lie lie Phe Asp Ser 
110 115 120 

Asp Trp Asn Pro Gin Asn Asp Leu Gin Ala Gin Ala Arg Cys His 
125 130 135 

Arg lie Gly Gin Ser Lys Ala Val Lys Val Tyr Arg Leu lie Thr 
140 145 150 

Arg Asn Ser Tyr Glu Arg Glu Met Phe Asp Lys Ala Ser Leu Lys 
155 160 165 

Leu Gly Leu Asp Lys Ala Val Leu Gin Ser Met Ser Gly Arg Asp 
170 175 180 

Gly Asn He Thr Gly He Gin Gin Phe Ser Lys Lys Glu He Glu 
185 190 195 

Asp Leu Leu Arg Lys Gly Ala Tyr Ala Ala He Met Glu Glu Asp 
200 205 210 

Asp Glu Gly Ser Lys Phe Cys Glu Glu Asp He Asp Gin He Leu 
215 220 225 

Leu Arg Arg Thr Thr Thr He Thr He Glu Ser Glu Gly Lys Gly 
230 235 240 

Ser Thr Phe Ala Lys Ala Ser Phe Val Ala Ser Glu Asn Arg Thr 
245 250 255 

Asp He 
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WO 01/62927 



PCT/USO 1/06059 



<210> 340 
<211> 63 
<212> PRT 

<213> Homo sapiens 



LG:255789.10.orf3 :2000MAY19 



lie Leu Phe Ser His Phe Pro , He Gin Met 
10 15 

Leu Val Pro Leu Leu Pro Ser Lys Thr He 
,25 30 

Arg Gly Gin Val Phe Leu Met Leu Leu Arg 
40 45 

Ala His Asp Leu Leu Val Lys Gly Ala Gly 
55 60 



LI: 787618. l.orfl: 200 0MAY01 



<220> 

<221> misc_feature 
<223> Incyte ID No: 

<400> 340 

Leu Lys Glu Leu Thr 
1 5 
Lys Thr Ala Ser Phe 
20 

Leu Phe Asp Arg Ala 
35 

Lys Pro Ser He Thr 
50 

Lys Tyr Lys 



<210> 341 

<211> 112 , 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<223> Incyte ID No: 

<400> 341 



Gly 


Thr 


Leu 


Met 


Asp 


1 








5 


He 


Cys 


Leu Cys 


Ser 










20 


Ser 


Pro 


Ala 


Prq 


Asn 










35 


His 


Trp 


Asn 


He 


Asn 








50 


Phe 


He 


Cys 


Ala 


His 








65 


Gin 


Ala 


Arg Leu 


Leu 










80 


Gin 


Ser 


Gin 


Ala 


Ser 










95 


Ser 


Ala 


Arg 


Pro 


He 








110 



<210> 342 
<211> 427 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc__f eature 
<223> Incyte ID No: 

<400> 342 



Thr 


Arg 


Thr 


Thr 


Thr 


1 






5 


Phe 


Arg 


Cys 


Thr 


Arg 
20 


Cys 


Ser 


Pro 


Arg 


He 








35 


Tyr 


Ser 


He 


Leu 


Pro 








50 


Tyr 


Val 


Ser 


Val 


Asn 



Pro 


Phe 


Pro 


Pro 


Cys 










10 


Ser 


Ser 


Pro 


Leu 


Lys 










25 


Val 


Ala 


Phe 


Ser 


His 










40 


Leu 


His 


Asp- Gin 


Lys 










55 


Ser 


Leu 


Leu 


Leu 


Ser 










70 


Leu 


Pro 


Ala 


Glu 


Asp 










85 


Gin 


Thr 


Arg 


Ser 


Cys 










100 


Ala 


Pro 









He 


Gin 


Asp 


Ser 


Ala 








15 


Asn 


Arg 


Glu 


Tyr 


He 
30 


Met 


Ser 


Ser 


Phe Gly 










45 


Leu 


Gly Lys 


Cys 


Ala 










60 


Pro 


Lys 


Glu 


Gin 


Ala 








75 


Lys 


Arg 


His 


Gin 


clu 
90 


Gin 


He 


Asn 


Gin 


Ser 
105 



LI: 33 1610 .2 .orf3 :2000MAY01 



Arg Leu Gly Ser Pro Lys Gly Ser Thr Cys 

10 15 
Thr Thr Ser Arg Glu Ser Thr Trp Ala Leu 

25 30 
Pro Thr Trp Ala Lys Asn Gly Thr Val Ser 

40 45 
Ser His He Gly Asp Val Ser He Tyr Thr 

55 60 
Pro Thr Asn Gly Ala He Tyr Ala Leu Arg 
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WO 01/62927 4 


m 












P 








65 




70 








75 


Ser 


Phe 


Asn Phe 


Glu 


Gin Thr Lys Ala 


Phe 


Glu 


Phe Lys 


Val 


Leu 








80 


85 








90 , 


Ala 


Lvs 


Asp Ser 


Gly 


Ala Pro Ala His 


Leu 


Glu 


Ser Asn 


Ala 


Thr 




95 




100 








105 


Val 


Arg 


Val Thr 


Val 


Leu Asp Val Asn 


Asp 


Asn 


Ala Pro 


Val 


He 






110 


115 








120 


Val 


Leu 


Pro Thr 


Leu 


Gin Asn Asp Thr 


Ala 


Glu 


Leu Gin Val 


Pro 








125 


130 








135 


Arg 


Asn 


Ala Gly 


Leu 


Gly Tyr Leu Val 


Ser 


His 


Cys Ala Arg 


Pro 








140 




145 








150 


Arg 


Gin 


Arg Leu 


Arg 


Arg Arg Ala Gly 


Val 


Ser 


Pro Thr 


Lys 


He 








155 




160 


» 






165 


Val 


Asp 


Gly Asn 


Asp 


Asp His Leu Phe 


Glu 


He 


Asp Pro 


Ser 


Ser 






170 




175 








180 


Gly 


Glu 


He Arg 


Thr 


Leu His Pro Phe 


Trp 


Glu 


Asp Val Thr 


Pro 






185 




190 








195 


Val 


Val 


Glu Leu 


Val 


Val Lys Val Thr 


Asp 


His 


Gly Lys 


Pro 


Thr 








200 


205 






210 


Leu 


Ser 


Ala Val 


Ala 


Lys Leu He He 


Arg 


Ser 


Val Ser Gly 


Ser 








215 


220 








225 


Leu 


Pro 


Glu Gly 


Val 


Pro Arg Val Asn 


Gly 


Glu 


Gin His 


His 


Trp 






230 


235 








240 


Asp 


Met 


Ser Leu 


Pro 


Leu lie Val Thr 


Leu 


Ser 


Thr He 


Ser 


He 






245 




250 








255 






T.an >1 a 

JJcU nm 


Ala 


Met He Thr He 


Ala 


Val 


Lys Cys 


Lys 


Arg 








260 




265 






270 






T^vra f2l 1 1 
uyo vjiu 


He 


Arg Thr Tyr Asn 


Cys 


Arg 


He Ala 


Glu 


Tyr 








275 




280 








285 


Cor 


nx & 




Leu 


Gly Gly Gly Lys 


Gly 


Lys 


Lys Lys 


Lys 


He 








290 




295 








300 


2X en 


XI Jf © 


t\s>ii nop 


He 


Met Leu Val Gin 


Ser 


Glu 


Val Glu 


Glu 


Arg 






305 




310 








315 


Asn 


niu 


Met Asn 


Val 


Met Asn Val Val 


Ser 


Ser 


Pro Ser 


Leu 


Ala 








320 




325 








330 


XXIX. 


Cor 


£r 1. \J vlxs L. 


Tyr 


Phe Asp Tyr Gin 


Thr 


Arg 


Leu Pro 


Leu 


Ser 








335 


340 








345 


OCX 


Pro 


niy OcX 


Glu 


Val Met Tyr Leu 


Lys 


Pro 


Ala Ser 


Asn 


Asn 








350 


355 








360 


Levi 


Thr 


Val Pro 


Gin 


Gly His Ala Gly 


Cys 


His 


Thr Ser 


Phe 


Thr 








365 


370 








375 


Gly 


Gin 


Gly Thr 


Asn 


Ala Ser Glu Thr 


Pro 


Ala 


Thr Arg Met 


Ser 




380 




385 








390 


He 


He 


Gin Thr 


Asp 


Asn Phe Pro Ala 


Glu 


Pro 


Asn Tyr Met 


Gly 








395 




400 








405 


Ser 


Arg 


Gin Gin 


Phe 


Val Gin Ser He 


Ser 


Val 


Ala Pro 


Arg 


Leu 






410 




415 








420 


Arg 


Thr 


Gin Lys 


Glu 


Pro Ala 






i 












425 















PCT/US01/06059 



<210> 343 
<211> 144 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LG: 982697 . 1 .orf 2 :2000FEB18 
<220> 

<221> unsure 
<222> 52, 56 
<223> unknown or other 

<400> 343 

Gly Ser He Glu Gly Lys Cys Gly Val Gly Gly Ser Asn Arg Val 
15 10 15 
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WO 01/62927 PCT/US01/06059 

>r^Asn Glv Thr lie Arg Ser Gly Pr^Le 



Thr Ala Gly Ala Leu Pro Asn Gly Thr lie Arg Ser Gly Pro^Leu 
20 25 30 , 

Pro Ser Arg Pro Lys Asp Asp Arg Ser Thr Ser Ser Leu Tyr Ser 
35 i 40 45 

Ala Pro Gly Lys Ala Thr Xaa Thr Gin Leu Xaa Pro Met Ser Ala 
■ 50 .55 60 

Ala Leu Asp Ser Leu Pro Cys Lys Ala lie Gly Ala Gly Leu Leu 
65 70 75 

Lys Ala Trp Gly Ala His Pro Leu Tyr Gin Cys Gly Leu Asp Val 
80 85 90 

Glu His Asp Val Glu Asp Tyr Phe Gly Thr Leu Arg Phe Ser Asp 
95 100 105 

Phe Pro Thr Gly Phe Trp Ser Cys Val Asp Pro Val Asp Pro Phe 
110 115 120 

Phe Trp Pro lie Ser Pro Phe Leu Gly Trp Lys His Leu Pro Asn 
125 130 135 

Pro Tyr Thr Pro lie Val Ser Trp Lys 
140 



<210> 344 
<211> 97 
<212> PRT 
. <213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: LG: 1080896 . 1 . orf 2 :2000FEB18 
<400> 344 



Lys 


Lys 


Leu 


Val 


Cys 


He 


Met 


Leu Phe 


Tyr 


Pro Arg Tyr Val 


Ala 


1 








5 








10 




15 


Gin 


GlU 


lie 


Thr 


Asn 


Leu 


Met 


Arg Val 


Met 


Thr His Phe Lys 


Pro 










20 






25 




30 


Pro 


Met 


He 


Thr 


Val 


Phe 


Lys 


He Gin 


Leu 


Arg Thr Gly Pro 


Phe 










35 






40 




45 


Leu 


Gly 


Asp 


Thr 


Arg 
50 


Lys 


GlU 


He Ala 


Arg 
55 


Thr Glu Glu Lys 


Gly 
60 


Val 


Pro 


Lys 


Asn 


Val 


Thr 


Leu 


Lys Phe 


Asp 


Ala Cys Ala Thr 


He 








65 








70 




75 


Asp 


Ser 


Lys 


Gin 


His 
80 


Gly 


He 


Gly Cys 


Gly 
85 


Ser Leu Asn Trp 


Lys 
90 


Lys 


Lys 


Leu 


His 


Ser 
95 


Arg 


Lys 











<210> 345 
<211> 75 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 81 1341 . 1 . orf 3 : 2000FEB01 



<400> 345 












Gly Leu 


Phe 


Gin 


Cys 


He 


His 


Gin 


1 






5 








Val Ala 


Thr 


Val 


Leu 


Leu 


Phe 


Tyr 








20 






Thr Leu 


Lys 


He 


Thr 
35 


Cys 


Leu 


Tyr 


Cys Ser 


Pro 


Gly 


Ser 
50 


Asp 


Gin 


Pro 


Glu Pro 


Pro 


Met 


Thr 
65 


Thr 


Val 


Phe 



Val Thr 


Glu 


Val Gly Gin Lys 


10 






15 


Gly Tyr 


Tyr 


Lys cys 


Thr Gly 


25 






30 


Asn Val 


He 


Leu Tyr 


Lys Val 


40 






45 


Asp Val 


Cys 


Tyr Asp 


Pro Ser 


55 






60 


Lys He 


Arg 


Leu Arg 


Thr Glu 


70 






75 



<210> 346 
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• 



<211> 135 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 903225 . 1 . orf 3 :2000FEB01 
<400> 346 

Asp Pro Phe Gin' Lys Met Ala Pro Lys Val Lys Lys Glu Ala Pro 

15 10 15 

Gly Pro Pro Lys Ala Glu Ala Lys Ala Lys Ala Leu Lys Ala Lys 

20 25 ; - 30 

Lys Val Val Leu Lys Gly Val His Gly His Lys Lys Lys Lys lie 

35 40 45 

Arg Met Ser Pro Thr Phe Gin Arg Pro ' Lys Thr Leu Arg Leu Trp 

50 55 60 

Arg Pro Pro Arg Tyr Pro Arg Lys Thr Thr Pro Arg Arg Asn Lys 

65 70 75 
Leu Asp His Tyr Ala lie He Lys Phe Pro Leu Thr Thr Glu Phe 

80 85' 90 

Ala Met Lys Lys He Lys Asp Asn Asn Thr Leu Val Phe Thr Val 

95 100 105 

Asp Val Lys Ala Asn Lys His Gin' He Lys Gin Ala Val Lys Lys 

liO ■ 115 120 
Leu Cys Asp He Asp Gly Ala Lys Val Asn Thr Leu Met Glu Arg 

125 130 135 



<210> 347 

<211> 55 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 242079 .2 .orf 2 :2000FEB01 
<400> 347 

Ser Ala Arg He Glu Ala Trp Glu Pro Pro Thr Arg Phe Gin Ser 
1 5 10 15 

Met Cys Gly Lys Ala Trp Met Ser Arg Gin Lys Pro Ala Ala Gly 

20 25 30 

Thr Glu Pro Ser Trp Arg Thr Ser Thr Arg Val Val Trp Arg Gly 

35 40 45 

Asn Leu Gly Leu Glu Phe Pro His Ser Phe 

50 55 



<210> 348 
<211> 129 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 979580 . 1 .orf 2 : 2000MAY19 
<400> 348 



Trp 


Lys 


Val Asn Gly 


Arg Asn Leu Ser Pro Phe 


Glu Glu He 


Gly 


1 








5 


10 




15 


Asn 


Gin 


Ser 


His 


Phe 


Val Ala Gin Ala Gly Val 


Gin Trp His 


Asn 










20 


25 




30 


Leu 


Ala 


His 


cys 


Asn 


His His Leu Pro Gly Ser 


Ser Asp Pro 


Pro 










35 


40 


45 


Thr 


Ser 


Thr 


Ser 


Gin 


Val Ala Gly Ser Ala Gly 


Val Arg His 


His 










50 


55 




60 


Thr 


Arg 


Leu 


He 


Phe 


Val Phe Leu Val Gin Lys 


Glu Phe His 


His 
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BNSDOCID: <WO_0162927A2LL> 



WO 01/62927 



Val Asp Gin Ala 
Thr Trp Ala Ser 
Ser Pro Ala Tyr 
Asn Glu Ala Phe 



<210> 349 

<211> 291 

i <212> PRT 

<213> Homo sapiens 



<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 1169865 . 1 . orf 2 : 2000MAY01 



<400> 349 






















Arcr 


Pro 


Thr 


Val 


Ser 


Val 


Ser Cys 


Ala 


Ser 


Ser 


Arg Pro 


Gin 


Phe 


1 








5 






10 








15 


Leu 


He 


Thr 


Val 


Pro 


Val 


Leu Thr Val 


He 


Asn 


Tyr Arg 


Pro 


His 










20 








25 






30 


Asn 


Met 


Ara 


Pro 


Glu Asp 


Arg Met 


Phe 


His 


He 


Arg Ala Val 


He 








35 








40 








45 


Leu 


Ara 


Ala 


Leu 


Ser 


Leu 


Ala Phe 


Leu 


Leu 


Ser 


Leu Arg Gly Ala 








50 








55 








60 


Gly 


Ala 


He 


Lvs 


Ala Asp 


His Val 


Ser 


Thr 


Tyr 


Ala Ala 


Phe 


Val 






65 








70 








75 


Gin 


Thr 


His 


Arg 


Pro 


Thr 


Gly Glu 


Phe 


Met 


Phe 


Glu Phe 


Asp 


Glu 








80 






85 








90 


Asp 


Glu 


Met 


Phe 


Tyr 


Val 


Asp Leu 


Asp 


Lys 


Lys 


Glu Thr 


Val 


Trp 








95 








100 








105 


His 


Leu 


Glu 


Glu 


Phe 


Gly 


Gin Ala 


Phe 


Ser 


Phe 


Glu Ala 


Gin 


Gly 










110 






115 








120 


Gly 


Leu 


Ala 


Asn 


He 


Ala 


He Leu 


Asn 


Asn 


Asn 


Leu Asn 


Thr 


Leu 








125 








130 








135 


He 


Gin 


Arg 


Ser 


Asn 


His 


Thr Gin 


Ala 


Thr 


Asn 


Asp Pro 


Pro 


Glu 








140 








145 








150 


Val 


Thr 


Val 


Phe 


Pro 


Lys 


Glu Pro 


Val 


Glu 


Leu 


Gly Gin 


Pro 


Asn 










155 






160 








165 


Thr 


Leu 


He 


Cys 


His 


He 


Asp Lys 


Phe 


Phe 


Pro 


Pro Val 


Leu 


Asn 








170 






175 








180 


Val 


Thr 


Trp 


Leu 


Cys 


Asn 


Gly Glu 


Leu 


Val 


Thr 


Glu Gly Val 


Ala 








185 








190 








195 


Glu 


Ser 


Leu 


Phe 


Leu 


Pro 


Arg Thr 


Asp 


Tyr 


Ser 


Phe His 


Lys 


Phe 










200 








205 








210 


His 


Tyr 


Leu 


Thr 


Phe 


Val 


Pro Ser 


Ala 


Glu 


Asp 


Phe Tyr Asp 


Cys 








215 








220 








225 


Arg 


Val 


Glu 


His 


Trp Gly 


Leu Asp 


Gin 


Pro 


Leu 


Leu Lys 


His 


Trp 








230 






235 








240 


Glu 


Ala 


Gin 


Glu 


Pro 


He 


Gin Met 


Pro 


Glu 


Thr 


Thr Glu 


Thr 


Val 










245 








250 








255 


Leu 


Cys 


Ala 


Leu 


Gly 


Leu 


Val Leu 


Gly 


Leu 


Val 


Gly He 


He 


Val 








260 








265 








270 


Gly 


Ser 


Val 


Leu 


He 


He 


Lys Ser 


Leu 


Arg 


Ser 


Gly His 


Asp 


Pro 








275 






280 








285 


Arg 


Ala 


Gin 


Gly Thr 


Leu 
























290 



















<210> 350 
<211> 517 
<212> PRT 
<213> Homo sapiens 

<220> 



183/228 



PCT7US01/060S9 



65 70 75 

Gly Leu Lys Leu Leu Thr Ser Ser Asp Trp Pro . 

80 85 90 

Gin Ser Ala Gly He Thr Gly Val Ser His Cys 

95 100 105 

Glu Val Val Phe Ala Val Lys Gin Gin Phe Gly 
110 115 120 

Leu Arg Ser Ser Val 
125 



BNSDOCID: <WO 01 62927 A2J_> 
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<221> misc_feat 

<223> Incyte ID No: LG:337818 . 2 .orf 1 :2000FEB18 

i 

<400> 350 

Arg Gly Ala Trp Glu Arg Arg Arg Ser Arg Pro Ala Glu Met Glu 
1 5 10 .15 

Ala Thr Gly Thr Trp Ala Leu Leu Leu Ala Leu Ala Leu Leu Leu 

,20 , 25 30 

Leu Leu Thr Leu Ala Leu Ser Gly Thr Arg Ala Arg Gly His Leu 
35 40 45 

Pro Pro Gly Pro Thr Pro Leu Pro Leu Leu Gly Asn Leu Leu Gin 
50 55 60 

Leu Arg Pro Gly Ala Leu Tyr Ser Gly Leu Met Arg 1 Leu Ser Lys 
65 70 75 

Lys Tyr Gly Pro Val Phe Thr lie Tyr Leu Gly Pro Trp Arg Pro 
80 1 85 90 

Val Val Val Leu Val Gly Gin Glu Ala Val Arg Glu Ala Leu Gly 
95 100 105 

Gly Gin Ala Glu Glu Phe Ser Gly Arg Gly Thr Val Ala Met Leu 

110 115 120 

Glu Gly Thr Phe Asp Gly His Gly Val Phe ■ Phe Ser Asn Gly Glu 

125 130 135 

Arg Trp Arg Gin Leu. Arg Lys Phe 1 Thr Met Leu Ala Leu Arg Asp 

140 145 150 

Leu Gly Met Gly L^s Arg Glu Gly Glu Glu Leu lie Gin Ala Glu 

155 160 165 

Ala Arg Cys Leu Val Glu Thr Phe Gin Gly Thr Glu Gly Arg Pro 

170 175 180 

Phe Asp Pro Ser Leu Leu Leu Ala Gin Ala Thr Ser Asn Val Val 

185 190 195 

Cys Ser Leu Leu Phe Gly Leu Arg Phe Ser Tyr Glu Asp Lys Glu 

200 205 210 

Phe Gin Ala Val Val Arg Ala Ala Gly Gly Thr Leu Leu Gly Val 

215 220 225 

Ser Ser Gin Gly Gly Gin Thr Tyr Glu Met Phe Ser Trp Phe Leu 

230 ^ 235 240 

Arg Pro Leu Pro Gly Pro His Lys Gin Leu Leu His His Val Ser 
< 245 250 255 

Thr Leu Ala Ala Phe Thr Val Arg Gin Val Gin Gin His Gin Gly 

260 265 270 

Asn Leu Asp Ala Ser Gly Pro Ala Arg Asp Leu Val Asp Ala Phe 

275 280 285 

Leu Leu Lys Met Ala Gin Glu Glu Gin Asn Pro Gly Thr Glu Phe 

290 295 300 

Thr Asn Lys Asn Met Leu Met Thr Val lie Tyr Leu Leu Phe Ala 

305 310 315 

Gly Thr Met Thr Val Ser Thr Thr Val Gly Tyr Thr Leu Leu Leu 

320 325 330 

Leu Met Lys Tyr Pro His Val Gin Lys Trp Val Arg Glu Glu Leu 

335 340 345 

Asn Arg Glu Leu Gly Ala Gly Gin Ala Pro Ser Leu Gly Asp Arg 

350 355 360 

Thr Arg Leu Pro Tyr Thr Asp Ala Val Leu His Glu Ala Gin Arg 

365 370 375 

Leu Leu Ala Leu Val Pro Met Gly lie Pro Arg Thr Leu Met Arg 

380 385 390 

Thr Thr Arg Phe Arg Gly Tyr Thr Leu Pro Gin Gly Thr Glu Val 

395 400 ~ 405 

Phe Pro Leu Leu Gly Ser lie Leu His Asp Pro Asn lie Phe Lys 

410 415 420 

His Pro Glu Glu Phe Asn Pro Asp Arg Phe Leu Asp Ala Asp Gly 

425 430 435 

Arg Phe Arg Lys His Glu Ala Phe Leu Pro Phe Ser Leu Gly Lys 

440 445 450 

Arg Val Cys Leu Gly Glu Gly Leu Ala Lys Ala Glu Cys Phe Leu 

455 460 465 

Phe Phe Thr Thr lie Leu Gin Ala Phe Ser Leu Glu Ser Pro Cys 
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470 



475 



480 



Pro Pro Asp Thr Leu, Ser Leu Lys Pro Thr Val Ser Gly Leu Phe 

485 490 495 

Asn lie Pro Pro Ala Phe Gin Leu Gin Val Arg Pro Thr Asp Leu 

500 505 510 
His Ser Thr Thr Gin Thr Arg 



<210> 351 
<211> 232 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 337818 . 1 . orf 1 : 2000FEB01 
<400> 351 



Leu 


Oor 

*-> <=JL 


Thr 


Asp 


He 


Asn 


He 


Val 


His 


Leu 1 


Gin 


Ser 


Gin 


Ala 


His 


1 






5 










10 










15 




Gin 




Thr 


His 


Gin 


Leu 


Pro' 


Trp 


lie 


Cys 


Ser 


Pro 


His 


Val 










20 










25 










30 






± j.p 


Leu 


Ser 


Pro 


Ser 


Gin 


Ala. 


Thr 


Glu 


Thr 


Ala 


Thr 


His 






35 










40 




Thr 






45 


Val 


His 


Ser 


Ser 


His 


Ala 


Leu 


Ser 


He 


His 


Arg 


Ser 


Gin 


Cys 










50 










55 










60 


Pro 


Cys 


Pro 


Trp 


Cys 


Leu 


Ala 


Gin Gly 


Thr 


Ala 


Cys 


Pro 


Leu 


Arg 








65 










70 










75 


Gly 


His 


Ala 


Thr 


Gin 


Arg 


Leu 


Ser 


Leu 


Ser 


Met 


Ala 


Pro 


Thr 


His 








80 








85 










90 


Ala 


Pro 


Ser 


Leu Gly 


Tyr 


Thr 


Thr 


Leu 


Pro 


Ala 


Cys 


Asp 


His 


Arg 










95 








100 










105 


Cys 


Pro 


His 


Thr 


Pro 


Asn 


His 


Leu 


Ser 


Thr 


Gin 


Leu 


Pro 


Thr 


Tyr 








110 










115 










12 0 


Asp 


He 


Val 


Leu 


Ala 


Pro 


Gin 


Ser 


He 


Phe 


Pro 


Leu 


Arg 


His 


Ala 








125 










130 










135 


Ala 


Pro 


Thr 


Glu 


Ala 


Gin 


Ser 


Pro 


Ala 


Thr 


Ser 


Ala 


Thr 


Ala 


Ala 










140 










145 










150 


Leu 


Ser 


His 


Pro 


Phe 


Leu 


Ser 


Thr 


Leu 


He 


Leu 


Pro 


Asn 


Ala 


Asn 










155 










160 










165 


Thr 


Ser 


Gly 


Ser 


Ala 


He 


Met 


His 


Arg 


Asp 


Phe 


Gly 


His 


Thr 


Arg 








170 










175 










180 


Thr 


Leu 


Arg 


Pro 


Glu 


Glu 


His 


Leu 


Pro 


Asn 


Pro 


Asn 


Thr Cys 


Leu 








185 










190 










195 


Cys 


Asn 


His 


Val 


Glu 


Ser 


Gly 


Pro 


Cys 


Cys 


Pro 


Ser 


Thr 


His 


Thr 








200 










205 










210 


Tyr 


Thr 


Leu 


Thr 


Asp 


Leu 


Gin 


Pro 


Leu 


Phe 


Gly 


Val 


Arg Val 


Pro 








215 










220 










225 


Thr 


Arg 


Pro 


Ser 


Gly 


Arg 


Gly 



















<210> 352 
<211> 220 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: LG:241577 . 4 .orf 1 :2000MAY19 
<400> 352 

Gly Lys Val Glu Val Glu Asp Glu Gly Cys Thr Ala Gin Lys Ala 
15 10 15 

Pro Val Arg Pro Gly Leu Leu Pro Pro Cys Leu Thr Pro Glu He 
20 25 30 

Gly Ala Gly Val Pro Ser Ala Gly Cys Pro Leu Cys Pro Ser Met 



515 



230 



35 



40 



45 
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Pro 


Pro 


Trp 


All 


Is 


Ser Tyr His 


Thr 


Ser 


His 


Val^Fo Met He 










50 






55 




60 


Pro 


Leu 


Leu 


Gly Pro 


Arg Ser Ser 


Phe 


Ser 


Arg 


Lys Trp Ser Ala 










65 






70 




75 ■ 


Arg 


Ala 


Arg 


Gly Gly 


Gly Lys Met 


Ser 


Pro 


Tyr 


Thr Asn Cys Tyr 










80 






85 




90 


Ala 


Gin 


Arg 


Tvr 


Tvr 


Pro Met Pro 


Glu 


Glu 


Pro 


Phe Cys Thr Glu 










95 






100 




105 


Leu 


Asn 


Ala 


Glu 


GlU 


Gin Ala Leu 


Lys 


Glu 


Lys 


Glu Lys Gly Ser 










110 






115 




120 


Trp 


Thr 


Gin 


Leu 


Thr 


His Ala Glu 


Lys 


Val 


Ala 


Leu Tyr Arg Leu 










125 






130 




135 


Gin 


Phe 


Asn 


Glu 


Thr 


Phe Ala Glu 


Met 


Asn 


Arg 


Arg Ser Asn Glu 










140 






145 




150 


Trp 


Lys 


Thr 


Val 


Met 


Gly Cys Val 


Phe 


Phe 


Phe 


He Gly Phe Ala 










155 






160 




165 


Ala 


Leu 


Val 


He 


Trp 


Trp Gin Arg 


Val 


Tyr 


Val 


Phe Pro Pro Lys 










170 






175 




180 


Pro 


He 


Thr 


Leu 


Thr 


Asp Glu Arg 


Lys 


Ala 


Gin 


Gin Leu Gin Arg 










185 






190 




195 


Met 


Leu 


Asp 


Met 


Lys 


Val Asn Pro 


Val 


Gin 


Gly 


Leu Ala Ser Arg 










200 






205 


210 


Trp 


Asp 


Tyr 


Glu 


Lys 


Lys Gin Trp 


Lys 


Lys 







215 220 

<210> 353 

<211> 95 

<212> PRT 

<213> Homo sapiens 

,<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 344786 . 4 .orfl : 2000MAY19 
<400> 353 



Pro 


He 


Leu Trp 


Ser 


Val Leu Ser 


Phe 


Ser 


He 


Glu Leu Cys 


Phe 


1 




5 






10 






15 


Cys 


Cys 


Phe He 


Leu 


Ser Leu Leu 


Cys 


Val 


Phe 


Cys Pro He 


Leu 








20 






25 






30 


Cys 


Ser 


Arg His 


Gin 


Glu Pro Arg 


His 


Pro 


Pro 


Pro Val Thr 


Tyr 








35 






40 






45 


Ser 


Ser 


Glu Pro 


Ala 


Arg Ser Leu 


Phe Arg 


Met 


He Thr Trp Arg 








50 






55 






60 


Ser 


Leu 


Arg Lys 


Leu 


Leu Lys Asn 


Thr 


Leu 


Val 


Pro Ser Leu 


Ser 








65 






70 






75 


Gly 


Leu 


Gly Pro 


Phe 


Arg His Phe 


Ser 


Val 


Ser 


Met Thr Gin 


Thr 






80 






85 






90 



Met Gin Arg His Phe 
95 

<210> 354 
<211> 331 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 414307 . 1 . orf 2 : 2000FEB01 
<220> 

<221> unsure 
<222> 191 

<223> unknown or other 
<400> 354 

Gly Met Gly Lys Leu Cys Leu Gly Pro Thr Leu Cys Pro Ala Ala 
1 5 10 15 
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Ala 


Ala 


Glu 


GlU 


Ser 


Arg 


Asp 


Ala Glu 


Pro 


Arg 


Arg Glu Leu Leu 










20 






25 




30 


Cys 


Ser 


Gly Arg 


Pro 


Trp 


Thr 


Trp Arg 


Ala 


Ala 


Ala Arg Val Thr 








35 






40 




45 


Thr 


Met 


He 


Pro 


Trp 


Val 


Leu 


Leu Ala 


Cys 


Ala 


Leu Pro Cys Ala 










50 








55 




60 


Ala 


Asp 


Pro 


Leu 


Leu 


Gly 


Ala 


Phe Ala 


Arg 


Arg 


Asp Phe Arg Lys 








65 






70 




75 


Gly 


Ser 


Pro 


Gin 


Leu 


Val 


Cys 


Ser Leu 


Pro 


Gly 


Pro Gin Gly Pro 








80 






85 




90 


Pro 


Gly 


Pro 


Pro 


Gly 


Ala 


Pro 


Gly Pro 


Ser Gly 


Met Met Gly Arg 








95 






100 




105 


Met. 


Gly 


Phe 


Pro 


Gly 


Lys 


Asp 


Gly Gin 


Asp Gly 


His Asp Gly Asp 








110 






115 




120 


Arg 


Gly 


Asp 


Ser 


Gly 


Glu 


Glu 


Gly Pro 


Pro 


Gly 


Arg Thr Gly Asn 










125 








130 




. . 135 


Arg 


Gly 


Lys 


Pro 


Gly 


Pro 


Lys 


Gly Lys 


Ala Gly 


Ala He Gly Arg 




140 








145 




150 


Ala 


Gly 


Pro 


Arg 


Gly 


Pro 


Lys 


Gly Val 


Asn Gly 


Thr Pro Gly Lys 










155 








160 




165 


His 


Gly 


Thr 


Pro 


Gly 


Lys 


Lys 


Gly Pro 


Lys 


Gly 


Lys Lys Gly Glu 








170 






175 




180 


Pro 


Gly 


Leu 


Pro 


Gly 


Pro 


Cys 


Ser cys 


Gly Xaa 


Gly His Thr Lys 








185 








190 




195 


Ser 


Ala 


Phe 


Ser 


Val 


Ala 


Val 


Thr Lys 


Ser 


Tyr 


Pro Arg Glu Arg 










200 








205 




210 


Leu 


Pro 


He 


Lys 


Phe 


Asp 


Lys 


He Leu 


Met 


Asn 


Glu Gly Gly His 










215 








220 




225 


Tyr 


Asn 


Ala 


Ser 


Ser 


Gly 


Lys 


Phe Val 


Cys 


Gly 


Val Pro Gly He 








230 








235 




240 


Tyr 


Tyr 


Phe 


Thr 


Tyr 


Asp 


He 


Thr Leu 


Ala 


Asn 


Lys His Leu Ala 






245 






250 




255 


lie 


Gly 


Leu 


Val 


His 


Asn 


Gly 


Gin Tyr 


Arg 


He 


Arg Thr Phe Asp 








260 








265 




270 


Ala 


Asn 


Thr 


Gly 


Asn 


His 


Asp 


Val Ala 


Ser 


Gly 


Ser Thr He Leu 








275 








280 




285 


Ala 


Leu 


Lys 


Gin 


Gly 


Asp 


Glu 


Val Trp 


Leu 


Gin 


He Phe Tyr Ser 








290 








295 




300 


Glu 


Gin 


Asn Gly 


Leu 


Phe 


Tyr 


Asp Pro 


Tyr 


Trp 


Thr Asp Ser Leu 










305 








310 




315 


Phe 


Thr 


Gly 


Phe 


Leu 


He 


Tyr 


Ala Asp 


Gin Asp 


Asp Pro Asn Glu 








320 








325 




330 



Val 



<210> 355 

<211> 93 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: LI : 202943 . 2 . orf 3 : 2000FEB01 
<400> 355 



Asn 


Val 


Arg 


Leu 


Gin 


Glu 


He Arg 


Lys 


Met 


Asp 


Ser 


Ser 


Gly Glu 


1 






5 








10 








15 


He 


Tyr 


Thr 


Val 


Gly 


Ala 


Glu Val 


Thr 


Phe 


Ser 


Cys 


Gin 


Glu Gly 








20 








25 








30 


Tyr 


Gin 


Leu 


Met 


Gly 


Val 


Thr Lys 


He 


Thr 


Cys 


Leu 


Glu 


Ser <31y 








35 






40 








45 


Glu 


Trp 


Asn 


His 


Leu 


He 


Pro Tyr 


Cys 


Lys 


Gly 


Met 


Phe 


Ser Lys 








50 








55 








60 


Phe 


Thr 


Thr 


Phe 


Leu 


Met 


Phe Gly Asn 


Pro 


Arg 


Lys 


Val 


Arg Arg 










65 








70 








75 


Arg 


His 


Met 


Lys 


Cys 


Tyr Val Val 


Ser 


Met 


Phe 


Phe 


Val 


Phe Glu 










80 








85 








90 
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Leu Arg Tyr 



<210> 356 
<211> 112 
<212> PRT 

<213> Homo sapiens 

<220> ' 
<221> misc_feature 

<223> Incyte ID N6: LI:246194.2 . orf 1 : 2000FEB01 
<220> 

<221> unsure 

<222> 25, 28, 52, 112 

<223> unknown or other 

<400> 356 



Ser Ser Thr 


His 


His 


Arg Arg Ser Thr Gly Ala Pro Thr 


Pro 


Gly 


1 




5 


10 




15 


Leu Pro Pro 


Pro 


Pro 


Pro Ala Thr Arg Xaa . Ser Cys Xaa 


Ala 


Ala 






20 


25 




30 


Ser Ala Ala 


Pro 


Gly 


Pro Gly Ala- Ala Pro Val Gly Ala 


Pro 


Thr 






35 


40 




45 


Pro Ala Ser 


Thr 


Thr 


Cys Xaa Val Leu Ala Arg Thr Thr 


Ser 


Pro 






50 


55 




60 


Ser Gly Ser 


Thr 


Ala 


Arg Ser Asp Ala Ala Glu Arg Gly Ser 


Pro 




65 


70 




75 


Gly Pro Gly 


Pro 


Pro 


Ala Gly Pro Ala Ala Gin Arg His 


Gly 


Gly 






80 


85 




90 


Arg His Pro 


His 


Gly 


Ser His Leu Asn Pro Gin His Pro 


He 


Lys 




95 


100 




105 


Phe Leu Phe 


Asn 


Thr 


Lys Xaa 










110 








<210> 357 












<211> 73 












<212> PRT 












<213> Homo sapiens 








<220> 












<221> misc_feature 








<223> Incyte ID 


No: 


LI : 815961 . 1 . orf 3 :2000FEB01 






<400> 357 












Cys Val Glu 


Glu Val 


Cys Glu Cys Lys Asp Val Glu Val Leu 


Pro 


1 




5 


10 




15 


Val Leu Asn 


Glu 


Leu 


Ala Asp Gly Leu Val Pro Leu Val 


Val 


Thr 






20 


25 




30 


Val He Gly 


Gly Ala 


Val Trp Val Asp Pro Val Thr Leu 


Ser 


Val 




35 


40 




45 


Val Ser Gly 


Gly Met 


Val Pro Val Gly Val Glu Trp Met 


Glu 


Ala 






50 


55 




60 


Glu Val Asp 


He 


Cys 


Ala Trp Val Gly Val Met Thr Leu 










65 


70 







<210> 358 
<211> 239 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: LG: 120744 . 1 . orf 1 .-2000MAY19 
<220> 

<221> unsure 
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<222> 199 

<223> unknown or other 
<400> 358 



Asn 


Glu 


Asn 


Arg 


Asn 


Met 


Ala 


His Tyr He Thr 


Phe Leu Cys Met 


1 








5 






10 


15 


Val 


Leu 


Val 


Leu 


Leu 


Leu 


Gin 


Asn Ser Val Leu 


Ala Glu Asp Gly 










20 






25 


30 


Glu 


Val 


Arg 


Ser 


Ser 


Cys 


Arg 


Thr Ala Pro Thr 


Asp Leu Val Phe 








35 


40 


45 


lie 


Leu 


Asp 


Gly 


Ser 


Tyr 


Ser 


Val Gly Pro Glu 


Asn. Phe Glu He 






50 




55 


60 


Val 


Lys 


Lys 


Trp 


Leu 


Val 


Asn 


He Thr Lys Asn 


Phe Asp He Gly 






65 






, 70 


75 


Pro 


Lys 


Phe 


lie 


Gin 


Val 


Gly 


Val Val Gin Tyr 


Ser Asp Tyr Pro 








80 






85 


90 


Val 


Leu 


Glu 


He 


Pro 


Leu 


Gly 


Ser Tyr Asp Ser 


Gly Glu His Leu 










95 




100 


105 


Thr 


Ala 


Ala 


Val 


Glu 


Ser 


He 


Leu Tyr Leu , Gly 


Gly Asn Thr Lys 










110 






115 


120 


Thr 


Gly 


Lys 


Ala 


He 


Gin 


Phe 


Ala, Leu Asp Tyr 


Leu Phe Ala Lys 






125 






130 


135 


Ser 


Ser 


Arg 


Phe 


Leu 


Thr 


Lys 


He Ala Val Val 


Leu Thr Asp Gly 










140 




' 145 


150 


Lys 


Ser 


Gin 


Asp 


Asp 


Val 


Lys 


Asp Ala Ala Gin 


Ala Ala Arg Asp 










155 






160 


165 


Ser 


Lys 


lie 


Thr 


Leu 


Phe 


Ala 


He Gly Val Gly 


Ser Glu Thr Glu 








170 






175 


180 


Asp 


Ala 


Glu 


Leu 


Arg 


Ala 


He 


Ala Asn Lys Pro 


Ser Ser Thr Tyr 








185 






190 


195 


Val 


Phe 


Tyr 


Xaa 


Glu 


Asp 


Tyr 


He Ala He Ser 


Lys He Arg Glu 








200 






205 


210 


Val 


Met 


Lys 


Gin 


Lys 


Leu 


Cys 


Glu Glu Ser Val 


Cys Pro Thr Arg 








215 






220 


225 


lie 


Pro 


Val 


Ala 


Ala 


Arg 


Asp 


Glu Arg Gly Phe 


Asp He Leu 










230 






235 





<210> 359 
<211> 528 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: LI : 757520 . 1 . orfl : 2000MAY01 
<400> 359 



Ala 


Ser 


His 


Ser Ala 


Ala 


Arg 


Thr 


Thr 


Phe 


Arg 


Cys 


Leu 


Ser 


Asp 


1 






5 








10 










15 


Ser 


Ala 


Gin 


Pro His 


Thr 


Met 


Ser 


Cys 


Arg 


Ser 


Tyr 


Arg 


He 


Ser 








20 








25 










30 


Ser 


Gly 


Cys 


Gly Val 


Thr 


Arg 


Asn 


Phe 


Ser 


Ser 


Cys 


Ser 


Ala 


Val 




35 








40 










45 


Ala 


Pro 


Lys 


Thr Gly 


Asn 


Arg 


Cys 


Cys 


He 


Ser 


Ala 


Ala 


Pro 


Tyr 






50 










55 










60 


Arg 


Gly 


Val 


Ser Cys 
65 


Tyr 


Arg 


Gly 


Leu 


Thr 
70 


Gly 


Phe 


Gly 


Ser 


Arg 
75 


Ser 


Leu 


Cys 


Asn Leu 
80 


Gly 


Ser 


Cys 


Gly 


Pro 
85 


Arg 


He 


Ala 


Val 


Gly 
90 


Gly 


Phe 


Arg Ala Gly 


Ser 


Cys 


Gly 


Arg 


Ser 


Phe 


Gly Tyr Arg 


Ser 








95 










100 










105 


Gly 


Gly 


Val 


Cys Gly 


Pro 


Ser 


Pro 


Pro 


Cys 


He 


Thr 


Thr 


Val 


Ser 






110 










115 










120 


Val 


Asn 


Glu 


Ser Leu 
125 


Leu 


Thr 


Pro 


Leu 


Asn 
130 


Leu 


Glu 


He 


Asp 


Pro 
135 


Asn 


Ala 


Gin 


Cys Val 


Lys 


Gin 


Glu 


Glu 


Lys 


GlU 


Gin 


He 


Lys 


Ser 








140 








145 










150 



189/228 
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Leu 


Asn 


Ser 


Arg* 


™e 


Ala 


Ala 


Phe 


He 


Asp 


Lys 


Val^Pg Phe Leu 










155 










160 


165 


Glu 


Gin 


Gin 


Asn 


Lys 


Leu 


Leu 


Glu 


Thr 


Lys 


Trp 


Gin Phe Tyr Gin 










170 










175 




180 1 


Asn 


Gin 


Arg 


Cys 


Cys 


Glu 


Ser 


Asn 


Leu 


GlU 


Pro 


Leu Phe Ser Gly 










185 








1 


190 




195 


Tyr 


He 


GlU 


Thr 


Leu 


Arg 


Arg 


GlU 


Ala 


GlU 


Cys 


Val Glu Ala Asp 










200 










205 




210 


Ser 


Gly 


Arg 


Leu 


Ala 


Ser 


Glu 


Leu 


Asn 


His 


Val 


Gin Glu Val Leu 










215 










220 




225 


Glu 


Gly 


Tyr 


Lys 


Lys 


Lys 


Tyr 


Glu 


Glu 


GlU 


Val 


Ala Leu Arg Ala 










230 










235 




240 


Thr 


Ala 


Glu 


Asn 


Glu 


Phe 


Val 


Val 


Leu 


Lys 


Lys 


Asp Val Asp Cys 










245 










250 


255 


Ala 


Tyr 


Leu 


Arg 


Lys 


Ser 


Asp 


Leu 


Glu 


Ala 


Asn 


Val Glu Ala Leu 










260 










265 




270 


Val 


Glu 


Glu 


Ser 


Ser 


Phe 


Leu 


Arg 


Arg 


Leu 


Tyr 


Glu Glu Glu He 










275 






280 


285 


Arg 


Val 


Leu 


Gin 


Ala 


His 


He 


Ser 


Asp 


Thr 


Ser 


Val He Val Lys 










290 










295 




300 


Met 


Asp 


Asn 


Ser 


Arg 


Asp 


Leu 


Asn 


Met 


Asp 


Cys 


He He Ala Glu 










305 










310 




315 


He 


Lys 


Ala 


Gin 


Tyr 


Asp 


Asp 


Val 


Ala 


Ser 


Arg 


Ser Arg Ala Glu 










320 










325 




330 


Ala 


Glu 


Ser 


Trp 


Tyr 


Arg 


Ser 


Lys 


Cys 


Glu 


Glu 


Met Lys Ala Thr 










335 










340 




345 


Val 


He 


Arg 


His 


Gly 


Glu 


Thr 


Leu 


Arg 


Arg 


Thr 


Lys Glu Glu He 










350 










355 




360 


Asn 


Glu 


Leu 


Asn 


Arg 


Met 


He 


Gin 


Arg 


Leu 


Thr 


Ala Glu He Glu 










365 










370 




375 


Asn 


Ala 


Lys 


cys 


Gin 


Arg 


Ala 


Lys 


Leu 


Glu 


Ala 


Ala Val Ala Glu 






380 






385 




390 


Ala 


Glu 


Gin 


Gin 


Gly 


Glu 


Ala 


Ala 


Leu 


Ser 


Asp 


Ala Arg Cys Lys 










395 










400 




405 


Leu 


Ala 


Glu 


Leu 


Glu 


Gly 


Ala 


Leu 


Gin 


Lys 


Ala 


Lys Gin Asp Met 










410 










415 




420 


Ala 


Cys 


Leu 


Leu 


Lys 


Glu 


Tyr 


Gin 


Glu 


Val 


Met 


Asn Ser Lys Leu 










425 










430 




435 


Gly 


Leu 


Asp 


He 


Glu 


He 


Ala 


Thr 


Tyr 


Arg 


Arg 


Leu Leu Glu Gly 










440 










445 




450 


Glu 


Glu 


His 


Arg 


Leu 


Cys 


Glu 


Gly 


Val 


Gly 


Ser 


Val Asn Val Cys 










455 










460 




465 


Val 


Ser 


Ser 


Ser 


Arg 


Gly 


Gly 


Val 


Ser 


Cys 


Gly 


Gly Leu Ser Tyr 










470 










475 




480 


Ser 


Thr 


Thr 


Pro 


Gly 


Arg 


Gin 


He 


Thr 


Ser 


Gly 


Pro Ser Ala He 










485 








490 


495 


Gly 


Gly 


Ser 


He 


Thr 


Val 


Val 


Ala 


Pro 


Asp 


Ser 


Cys Ala Pro Cys 










500 










505 




510 


Gin 


Pro 


Arg 


Ser 


Ser 


Ser 


Phe 


Ser 


Cys 


Gly 


Ser 


Ser Arg Ser Val 










515 








520 




525 


Arg 


Phe 


Ala 





















<210> 360 
<211> 157 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> incyte ID No: LG : 160570 . l.orf 2 :2000PEB18 
<220> 

<221> unsure 
<222> 150 

<223> unknown or other 
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<400> 360 



Val 


GlU 


Gly Ala 


Ala 


Gly 


Arg 


Leu 


Gly 


Gly 


Ser 


Phe 


Leu 


Ser 


Trp 


1 




5 








10 










15 


Leu 


Glu 


Thr Thr 


Thr 


Thr 


Asp 


Met 


Gly 


Cys 


Phe 


Cys 


Ala 


Val 


Pro 








20 








25 










30 


GlU 


Glu 


Phe Tyr 


Cys 


Glu 


Val 


Leu 


Leu 


Leu 


Asp 


GlU 


Ser 


Lys 


Leu 






35 










40 










45 


Thr 


Leu 


Thr Thr 


Gin 


Gin 


Gin 


Gly 


He 


Lys 


Lys 


Ser 


Thr 


Lys 


Gly 








50 








55 










60 


Ser 


Val 


Val Leu 


Asp 
65 


His 


Val 


Phe 


His 


His 
70 


Val 


Asn 


Leu 


Val 


Glu 
75 


lie 


Asp 


Tyr Phe 


Gly 


Leu 


Arg 


Tyr 


Cys 


Asp 


Arg 


Ser 


His 


Gin 


Thr 






80 










85 










90 


Tyr 


Trp 


Leu Asp 


Pro 


Ala 


Lys 


Thr 


Leu 


Ala 


Glu 


His 


Lys 


Glu 


Leu 


95 










100 










10b 


He 


Asn 


Thr Gly 


Pro 


Pro 


Tyr 


Thr 


Leu 


Tyr 


Phe Gly 


He 


Lys 


Phe 






110 










115 










120 


Tyr 


Ala 


Glu Asp 


Pro 
125 


Cys 


Lys 


Leu 


Lys 


Glu 
130 


Glu 


He 


Thr 


Arg 


Tyr 
135 


Ser 


He 


Asp Phe 


Val 


Phe 


Glu 


Gin 


lie 


His 


Ala 


Leu Arg 


He 


Xaa 






140 










145 










150 


Lys 


Ala 


Leu Phe 


Lys 


Thr 


Asn 



















155 

<210> 361 

<211> 65 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI:350398 . 3 .orf 3 :2000FEB01 
<220> 

<221> unsure 

<222> 22-23, 56, 65 

<223> unknown or other 



<400> 361 

Gly Leu Pro Lys Pro Gly Ala Leu Val Gly Asp Arg Ala Ala Pro 
15 10 15 

Ala Trp Val Gin Pro Pro Xaa Xaa Gin Val Asn Arg Phe His Lys 

20 25 30 

He Arg Asn Arg Ala Leu Leu Leu Thr Asp Gin His Leu Tyr Lys 

35 40 45 

Leu Asp Pro Asp Arg Gin Tyr Arg Val Met Xaa Ala Val Pro Leu 

50 ~ 55 , 60 

Glu Ala Val Thr Xaa 

65 

<210> 362 
<211> 517 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 221285 . 1 . orf 3 : 200 0FEB01 
<400> 362 

Leu Ala Ala Arg Gly Val Leu Ser Arg Gly Gin Pro Gly Ser Ala 
15 10 15 

Ala Ala Pro Arg Gin Glu Lys Gin Pro Arg Thr Pro Trp Lys Ser 

20 25 30 

Arg Ser Thr Trp Ala Cys Glu Asn Gly Ala Asn Thr Ser Pro Ala 

35 40 45 

Lys Pro His Ser Lys Ala Gly Pro Arg Thr Ala Thr Val Ala Pro 
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50 55 60 

Gin Ala Ser Arg Met Thr Val' Lys Pro Ala Lys Ala Ala Ser Leu 
65 ' ( 70 75 

Ala Arg Asn Leu Ala Lys Arg Arg Arg Thr Tyr Leu Gly Gly Ala 
80 85 90 

Ala Gly Arg Ser Gin Glu Pro Glu Val Pro Cys Ala Ala Val Leu 
95 100 105 

Pro Gly Lys Pro ,Gly Asp Arg Asn Cys Pro Glu Phe Pro Pro Pro 
' , 110 ' 115 120 

Asp Arg Thr Leu- Gly Cys Trp Ala Thr Asp Ala Ala Pro Ala Ala 
125 130 135 

Gly Leu Cys Gly Ala Gly Ser Glu Pro Ser lie Ala Pro Thr Ser 
140 145 ' 150 

Cys Ala Gly Asn Leu Pro Ser Arg Pro Pro Pro Leu Leu Ser Pro 
155 160 165 

Leu Leu Ala Ser Arg Asn Pro Cys Pro' Trp His Tyr Leu His Leu 
170 175 180 

Ser Gly Ser His Asn Thr Leu Ala Pro Thr Cys Phe Lys Ala Lys 
185 190 195 

Leu His Arg Lys Arg Gly Ser Gin Pro Pro Asp Met Ala Ser Ala 

200 205' 210 1 

Leu Thr Asp Arg Thr Ser Arg Ala Pro Ser Thr Tyr Thr Tyr Thr 
215 220 225 

Ser Arg Pro Arg Ala Leu Pro Cys 1 Gin Arg Ser Arg Tyr Arg Asp 
2*30 . 235 240 

Ser Leu Thr Gin Pro Asp Glu Glu Pro Met His Ty* Gly Asn lie 
245 250 255 

Met Tyr Asp Arg Arg Val lie Arg Gly Asn Thr Tyr Ala Leu Gin 
260 265 270 

Thr Gly Pro Leu Leu Gly Arg Pro Asp Ser Leu Glu Leu Gin Arg 
275 280 285 

Gin Arg Glu Ala Arg Lys Arg Ala Leu Ala Arg Lys Gin Ala Gin 
290 295 300 

Glu Gin Leu Arg Pro Gin Thr Pro Glu Pro Val Glu Gly Arg Lys 
305 310 315 

His Val Asp Val Gin Thr Glu Leu Tyr Leu Glu Glu lie Ala Asp 
320 325 330 

Arg lie lie Glu Val Asp Met Glu Cys Gin Thr Asp Ala Phe Leu 
335 - 340 345 

Asp Arg Pro Pro Thr Pro Leu Phe lie Pro Ala Lys Thr Gly Lys 
350 355 360 

Asp Val Ala Thr Gin lie Leu Glu Gly Glu Leu Phe Asp Phe Asp 
365 370 375 

Leu Glu Val Lys Pro Val Leu Glu Val Leu Val Gly Lys Thr lie 
380 385 390 

Glu Gin Ser Leu Leu Glu Val Met Glu Glu Glu Glu Leu Ala Asn 
395 400 405 

Leu Arg Ala Ser Gin Arg Glu Tyr Glu Glu Leu Arg Asn Ser Glu 
410 415 420 

Arg Ala Glu Val Gin Arg Leu Glu Glu Gin Glu Arg Arg His Arg 
425 430 435 

Glu Glu Lys Glu Arg Arg Lys Lys Gin Gin Trp Glu He Met His 
440 445 450 

Lys His Asn Glu Thr Ser Gin Lys He Ala Ala Arg Ala Phe Ala 
455 460 465 

Gin Arg Tyr Leu Ala Asp Leu Leu Pro Ser Val Phe Gly Ser Leu 
470 475 480 

Arg Asp Ser Gly Tyr Phe Tyr Asp Pro He Glu Arg Asp He Glu 
'485 ~ 490 495 

He Gly Phe Leu Pro Trp Leu Met Asn Glu Val Glu Lys Thr Met 
500 505 510 

Glu Tyr Ser Met Val Gly Arg 
515 

<210> 363 
<211> 60 
<212> PRT 
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<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: LI : 401605 . 2 . oirf 2 : 2000FEB01 
<400> 363 



Ala Glu Arg 


Gly , Leu 


Arg 


Thr Leu 


Leu 


Ser Leu Glu Asp 


Glu Arg 


1 




5 








10 


15 


Met Cys His 


Ser 


Gly 


Gin 


Met Gly 


Ser 


Leu Leu Gly Thr 


Val Cys 






20 








25 


30 


Ser Glu Ser 


Val 


Pro 


Ser 


Thr Pro 


Lys 


Lys Pro Pro Lys 


Ser Trp 






35 








40 


45 


Ala Ser Leu 


Trp 


Asn 


Gin 


Gin lie 


Leu, His Phe Gly Ala 


Tyr Lys 






50 








55 


60 



<210> 364 
<211> 239 
<212> PRT 
<213> Homo sapiens 

<220> i 
<221> mi sc_f eature 

<223> Incyte ID No: LI : 32 9 017 . 1 . orf 1 : 2000FEB01 
<400> 364 



lie 


Tyr 


Thr 


Glu 


Val 


Glu 


Gin Leu 


Gly 


Trp 


Lys 


Leu Tyr 


Gly Asp 


1 






5 








10 








15 


Lys 


Leu 


Ala 


Thr 


Ser 


Ser Gly Asp 


Thr 


Thr 


Val 


Lys Leu 


Trp 


Asp 










20 








25 








30 


Leu 


Cys 


Thr Gly 


Asp 


Cys 


He Leu 


Thr 


Phe 


Glu 


Gly His 


Ser Arg 








35 








40 








45 


Ala 


Val 


Trp 


Ser 


Cys 


Thr 


Trp His 


Ser 


Cys 


Gly 


Asn Phe 


Val 


Ala 










50 








55 








60 


Ser 


Ser 


Ser 


Leu 


Asp 


Lys 


Thr Ser 


Lys 


He 


Trp 


Asp Val 


Asn 


Ser 










65 






70 








75 


Glu 


Arg 


Cys 


Arg 


Cys 


Thr 


Leu Tyr Gly 


His 


Thr 


Asp Ser 


Val 


Asn 










80 








85 








90 


Ser 


lie 


GlU 


Phe 


Phe 


Pro 


Phe Ser 


Asn 


Thr 


Leu 


Leu Thr 


Ser 


Ser 










95 








100 








105 


Ala 


Asp 


Lys 


Thr 


Leu 


Ser 


He Trp 


Asp 


Ala 


Arg 


Thr Gly 


He 


Cys 








110 








115 








120 


GlU 


Gin 


Ser 


Leu 


Tyr 


Gly His Met 


His 


Ser 


He 


Asn Asp 


Ala 


He 










125 








130 








135 


Phe 


Asp 


Pro 


Arg 


Gly 


His 


Met He 


Ala 


Ser 


Cys 


Asp Ala 


Cys 


Gly 










140 








145 








150 


Val 


Thr 


Lys 


Leu 


Trp 


Asp 


Phe Arg 


Lys 


Leu 


Leu 


Pro He 


Val 


Ser 








155 








160 








165 


lie 


Asp 


He Gly 


Pro 


Ser 


Pro Gly Asn 


Glu 


Val 


Asn Phe 


Asp 


Ser 








170 








175 








180 


Ser 


Gly 


Arg Val 


Leu 


Ala 


Gin Ala 


Ser 


Gly 


Asn 


Gly Val 


He 


His 








185 








190 








195 


Leu 


Leu 


Asp 


Leu 


Lys 


Ser Gly Glu 


He 


His 


Lys 


Leu Met 


Gly His 








200 








205 








210 


Glu 


Asn 


Glu 


Ala 


His 


Thr 


Val Val 


Phe 


Ser 


His 


Asp Gly 


Glu 


He 










•215 








220 








225 


Leu 


Phe 


Ser Gly 


Gly 


Ser 


Asp Gly Thr 


Val 


Arg 


Thr Trp 


Ser 





230 235 

<210> 365 
<211> 160 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 
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WO 01/62927 PCI7US01/06059 
<223> Incyte hSBF: LI : 401322 . 1 . orfl : 2000FEB01 



<400> 365 



Ala Thr 


Gin Trp 


Glu 


Ser Pro Pro Gly 


Asp 


Thr 


Thr Gly Gly Ser 


1 




5 




10 




15 


Pro Gly 


lie Tyr 


Lys 


Val Pro Pro Ala 


Thr 


Ala 


Arg Trp Asp Ser 






20 




25 




30 


Trp Cys 


Cys Trp 


Arg 


Pro Val Trp Cys 


Leu 


Gin 


Ser Ala Thr Leu 


• 




35 




40 




45 


Gin Leu 


Gin Leu 


Leu 


Asp Gin Pro Cys 


Val 


Ser 


Ala Ser Pro Ser 






50 


55 




60 


Met Trp 


Ala Arg 


Pro 


Glu Cys Arg Trp 


Ala 


Met 


Pro Ala Gly Ser 






65 




70 




75 


Cys Thr 


Ala Trp 


Ser 


Thr Thr Ser Ser 


Pro 


Val 


Ala Pro Cys Pro 






80 




85 




90 


Ala Thr 


Arg Pro 


Trp 


Gly Ala Val lie 


Thr 


Pro 


Ser Thr Pro Ser 




95 


100 




105 


Ser Gly 


Arg Pro 


Ser 


Leu Ala Gly Met 


Cys 


Pro 


Gly Leu Ser Val 






110 




115 




120 


Asp Leu 


Glu Pro 


Ala 


Val lie Gly Trp 


His 


Gin 


Leu Pro Val Pro 






125 




130 




135 


His Ser 


Gly Ala 


Arg 


Gly Cys Cys Ser 


Gin 


Gly 


Ala Ala Gly Ser 






140 




145 




150 


Leu Arg 


Ala Lys 


Gin 


Tyr His Ser His 


His 








155 


160 







<210> 366 
<211> 757 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG:403409 . 1 .orf 3 :2000MAY19 
<400> 366 



Gly 


Arg Gly Arg Arg 


Lys Pro Asn Glu 


Phe 


Leu 


Gly Gly Cys Arg 


1 


5 




10 




15 


Met 


Gly Asp Ser Lys 


Val Lys Val Ala 


Val 


Arg 


He Arg Pro Met 




20 




25 




30 


Asn 


Arg Arg Glu Thr 


Asp Leu His Thr 


Lys 


Cys 


Val Val Asp Val 




35 




40 




45 


Asp 


Ala Asn Lys Val 


He Leu Asn Pro 


Val 


Asn 


Thr Asn Leu Ser 




50 




55 




60 


Lys 


Gly Asp Ala Arg 


Gly Gin Pro Lys 


Val 


Phe 


Ala Tyr Asp His 




65 




70 




75 


Cys 


Phe Trp Ser Met 


Asp Glu Ser Val 


Lys 


Glu 


Lys Tyr Ala Gly 




80 




85 




90 


Gin 


Asp He Val Phe 


Lys Cys Leu Gly 


Glu 


Asn 


He Leu Gin Asn 




95 


100 




105 


Ala 


Phe Asp Gly Tyr 


Asn Ala Cys He 


Phe 


Ala 


Tyr Gly Gin Thr 




110 




115 




120 


Gly 


Ser Gly Lys Ser 


Tyr Thr Met Met 


Gly 


Thr 


Ala Asp Gin Pro 




125 




130 




135 


Gly 


Leu He Pro Arg 


Leu Cys Ser Gly 


Leu 


Phe 


Glu Arg Thr Gin 




140 




145 




150 


Lys 


Glu Glu Asn Glu 


Glu Gin Ser Phe 


Lys 


Val 


Glu Val Ser Tyr 




155 




160 




165 


Met 


Glu He Tyr Asn 


Glu Lys Val Arg 


Asp 


Leu 


Leu Asp Pro Lys 




170 




175 




180 


Gly 


Ser Arg Gin Thr 


Leu Lys Val Arg 


Glu 


His 


Ser Val Leu Gly 




185 




190 




195 


Pro 


Tyr Val Asp Gly 


Leu Ser Lys Leu 


Ala 


Val 


Thr Ser Tyr Lys 




200 




205 




210 


Asp 


He Glu Ser Leu 


Met Ser Glu Gly 


Asn 


Lys 


Ser Arg Thr Val 




215 




220 


225 


Ala 


Ala Thr Asn Met 


Asn Glu Glu Ser 


Ser 


Arg 


Ser His Ala Val 
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230 



Phe 


Lys 


He 


Thr 


Leu 


Thr 










245 




Thr 


Ser 


Gly Glu 


Lys 


Val 










260 




Gly 


Ser 


Glu Arg Ala Thr 


• 






275 




Lys 


Glu 


Gly Ser Asn 


He 








290 




Val 


He 


Ser 


Ala 


Leu 


Ala 










305 




Lys 


Phe 


Val 


Pro 


Tyr 


Arg 


Asp 








320 




Ser 


Leu Gly Gly Asn 










335 




Ser 


Pro 


Ala Ala Asp Asn 










350 




Tyr 


Ala 


Asp Arg Ala 


Lys 










365 




Glu 


Asp 


Pro 


Asn 


Ala 


Arg 










380 




Glu 


Lys 


Leu 


Arg 


Glu 


Gin 










395 




Pro 


Glu 


Leu 


Lys 


Asp 


Arg 










410 




Glu 


Met 


Thr 


Val 


Thr 


Trp 










425 




lie 


Ala 


Gin 


Glu 


Arg 


Gin 










440 




Leu 


Gin 


Ser 


Ser 


Gly 


He 










455 




Val 


Asn 


Leu 


Asn 


Ala 


Asp 










470 


Tyr 


Leu 


Lys 


Glu 


His 


Thr 






485 




He 


Gin 


Leu 


Cys 


Gly Met 








500 




Asp 


He 


Thr 


Ser 


Glu 


Gly 










515 




Thr 


Arg 


Thr 


Phe 


Val 


Asn 










530 




Leu 


His 


His 


Gly Asp 


Arg 










545 




Arg 


Leu 


Asn 


Leu 


Pro 


Lys 










560 




Glu 


Asp 


Gin Asp 


Pro 


Ser 










575 




Leu 


Asp 


Val 


Asp 


Gly Asp 










590 




Asn 


Phe 


Asn Tyr 


Glu Tyr 










605 




Leu 


Gly 


Ser Asn Asp 


Pro 










620 




Gin 


Gin 


His 


Glu 


Glu 


Glu 










635 




Leu 


Met 


Tyr 


Glu 


His 


Glu 








650 




Pro 


Glu 


Lys 


Gin 


Asn 


Cys 








665 




Ser 


Pro 


Ser 


Ala 


Gin 


Gin 










680 




Glu 


Ala 


Thr 


Leu 


Asn 


Asn 










695 




Val 


Lys 


Ala 


Asn 


Leu 


Leu 








710 




Glu 


Leu 


Asp 


Lys 


Arg 


Thr 



725 



235 



His 


Thr 


Leu 


Tyr 


Asp Val 








250 






Gly 


Lys 


Leu 


Ser 


Leu 


Val 






265 






Lys 


Thr 


Gly 


Ala 


Ala Gly 








280 






Asn 


Lys 


Ser 


Leu 


Thr 


Thr 






295 






Asp 


Gin 


Ser 


Ala 


Gly Lys 








310 






Asp 


Ser 


Val 


Leu 


Thr 


Trp 






325 






Ser 


Lys 


Thr 


Ala 


Met 


Val 




Glu 


340 






Tyr 


Asp 


Thr 


Leu 


Ser 








355 






His 


He 


Val 


Asn 


His 


Ala 








370 






He 


He 


Arg 


Asp 


Leu Arg 








385 






Leu 


Thr 


Lys 


Ala 


Glu 


Ala 








400 






Leu 


Glu 


Glu 


Ser 


Glu 


Lys 








415 




GlU 


GlU 


Lys . 


Leu 


Arg 


Lys 








430 






Lys 


Gin 


Leu 


Glu 


Ser 


Leu 






445 






Lys 


Val 


Gly 


Asp 


Asp 


Lys 








460 






Pro 


Ala 


Leu 


Asn 


Glu 


Leu 








475 






Leu 


He 


Gly 


Ser 


Ala 


Asn 






490 






Gly 


He 


Leu 


Pro 


Glu 


His 






505 






Gin 


Val 


Met 


Leu 


Thr 


Pro 








520 






Gly 


Ser 


Ser 


Val 


Ser 


Ser 






535 






He 


Leu 


Trp 


Gly 


Asn 


Asn 






550 






Lys 


Lys 


Lys 


Lys 


Ala 


Glu 








565 






Met 


Lys 


Asn 


GlU 


Asn 


Ser 






580 






Ser 


Ser 


Ser 


Glu 


Val 


Ser 








595 






Ala 


Gin 


Met 


Glu 


Val 


Thr 








610 






Met 


Gin 


Ser 


He 


Leu 


Asn 








625 






Lys 


Arg 


Ser 


Ala 


Leu 


Glu 








640 






Leu 


Glu 


Gin 


Leu 


Arg 


Arg 








655 






Arg 


Ser 


Met 


Asp 


Arg 


Phe 








D / U 






Arg 


Leu 


Arg 


Gin 


Trp 


Ala 








685 






Ser 


Leu 


Met 


Arg 


Leu 


Arg 








700 






Val 


Arg 


Glu 


Ala 


Asn 


Tyr 








715 






Glu 


Tyr 


Lys 


Val 


Thr 


Leu 



730 
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^40 


Lys 


Ser 


Gly 






255 


Asp 


Leu 


Ala 






270 


Asp Arg 


Leu 






285 


Leu Gly 


Leu 






300 


Asn 


Lys 


Asn 






315 


Leu 


Leu 


Lys 






330 


Ala 


Thr 


Val 






345 


Thr Leu Arg 






360 


Val 


Val 


Asn 






375 


Glu 


Glu 


Val 






390 


Met 


Lys 


Ser 






405 


Leu 


He 


Gin 






420 


Thr 


Glu 


Glu 






435 


Gly 


He 


Ser 






450 


Cys 


Phe 


Leu 






465 


Leu 


Val 


Tyr 






480 


Ser 


Gin Asp 






495 


Cys 


He 


He 




510 


Gin Lys 


Asn 






525 


Pro 


He 


Gin 






540 


His 


Phe 


Phe 






555 


Arg 


Glu 


Asp 






570 


Ser 


Glu 


Gin 






585 


Ser 


Glu 


Val 






600 


Met 


Lys 


Ala 






615 


Ser 


Leu 


Glu 






630 


Arg Gin Arg 






645 


Arg 


Leu 


Ser 






660 


Ser 


Phe 


His 






675 


Glu 


Glu 


Arg 






690 


Glu 


Gin 


He 






705 


He 


Ala 


Glu 






720 


Gin 


He 


Pro 






735 



BNSDOCID: <WO 0162927A2_L> 
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Ala Ser Ser Leu^Ep Ala Asn Arg Lys Arg Gly ser ^Bu Leu Ser 
740 ' 745 750 

Glu Pro Ala lie Gin Val Arg ' 
755. 

<210> 367 

<211> 162 

<212> PRT ' 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 233933 . 5. orf3 :2000MAY1? 1 

<400> 367 ' 
Pro Arg Gin His Phe Cys Met Gin Tyr- Ser Leu Gin Val Val Thr 
1 .5 10 15 

Ala Thr Cys Lys Phe Gly Met Asn Ala Leu Leu Leu Ser Ala Trp 
20 25 30 

Phe Gly His Leu Arg lie Leu Gin lie Leu Val Asn Ser Gly Ala 1 

35 40* 45 1 

Lys lie His Cys Glu Ser Lys Glu Gly Asn Thr Ala Leu His Leu 
50. ■ 55 60 

Ala Ala Gly Arg Gly His Met Ala Val Leu Gin Arg Leu Val Asp 
•65 70 75 

lie Gly Leu Asp Leu Glu Glu Gin Asn Ala Glu Gly Leu Thr Ala 

80 85 90 ■ , 

Leu His Ser Ala Ala Gly Gly Ser His Pro Asp Cys Val Gin Leu 
95 100 105 

Leu Leu Arg Ala Gly Ser Thr Val Asn Ala Leu Thr Gin Lys Asn 
110 115 120 

Leu Ser Cys Leu His Tyr Ala Ala Leu Ser Gly Ser Glu Asp Val 
125 130 135 

Ser Arg Val Leu lie His Ala Gly Gly Cys Ala Asn Val Val Asp 
140 145 150 

His Gin Gly Ala Ser Pro Leu His Leu Ala Val Arg 
155 160 

<210> 368 
<211> 635 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LI .-290344 . 1 . orf 2 :200 0MAY01 
<400> 368 

Ala Leu Val Phe Met Gin Pro Met Val Met Gin Gly Cys Pro Tyr 
15 10 15 

Thr Leu Pro Arg Cys His Asp Trp Gin Ala Ala Asp Gin Phe His 

20 25 30 

His Ser Ser Ser Leu Arg Ser Thr Cys Pro His Pro Gin Val Arg 

35 40 45 

Ala Ala Val Thr Ser Pro Ala Pro Pro Gin Asp Gly Ala Gly Val 

50 55 60 

Pro Cys Leu Ser Leu Lys Leu Leu Asn Gly Ser Val Gly Ala Ser 

65 70 75 

Gly Pro Leu Glu' Pro Pro Ala Met Asn Leu Cys Trp Asn Glu lie 

80 85 90 

Lys Lys Lys Ser His Asn Leu Arg Ala Arg Leu Glu Ala Phe Ser 

95 100 105 

Asp His Ser Gly Lys Leu Gin Leu Pro Leu Gin Glu lie He Asp 
110 115 120 

Trp Leu Ser Gin Lys Asp Glu Glu Leu Ser Ala Gin Leu Pro Leu 
125 130 • 135 

Gin Gly Asp Val Ala Leu Val Gin Gin Glu Lys Glu Thr His Ala 
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140 



Ala 


Phe 


Met Glu Glu 


Val 










155 




Val 


Leu 


Glu 


Ser 


Ala 


Gin 










170 




Glu 


Leu 


Glu 


Glu 


Pro 


His 










185 




Gin 


Arg 


He 


Gin Asn 


Leu 








200 




Val 


Ala 


Ser 


Glu 


Leu 


Trp 










215 


Gin 


His 


Arg His 


He 


Glu 










230 




Gin 


Gly 


Ala 


Met 


Glu 


Glu 








245 




Gly 


Val 


Arg Ala 


Thr 


Trp 










260 




Ser 


Leu 


Pro 


Glu 


His 


He 










275 




Phe 


Ser 


Pro 


Met 


Lys 


Asp 










290 




His 


Gin 


Leu 


Ala 


lie 


Ser 










305 




Gin 


Ala 


Leu 


Glu 


Gin 


He 










320 




Ser 


Val 


Asp 


Glu 


Arg 


Leu 








335 




Phe 


Gly 


Pro Gly 


Ser 


Gin 










350 




Pro 


Trp 


Glu Arg 


Ala 


He 










365 




Asn 


His 


Gin 


Ala 


Gin 


Thr 










380 




Glu 


Leu 


Tyr Gin 


Thr 


Leu 










395 




Ala 


Tyr 


Arg Thr 


Ala 


Met 










410 




Arg 


Leu 


Asp Leu Val 


Thr 










425 




Glu 


His 


Asp 


Leu 


Gin 


Ala 










440 




Val 


He 


His 


Cys 


Leu 


Thr 








455 




Arg 


Gly 


He 


Leu 


Val 


Asn 










470 




Asn 


Trp 


Leu 


Leu 


Asn 


Val 








485 




Arg 


Ala 


Leu 


Ser 


Phe 


Lys 










500 




Glu 


Val 


Lys 


Glu 


Lys 


Leu 








515 




Ser 


Gly 


Ser 


Gin 


Cys 


Asp 










530 




Glu 


Ala 


He 


Gin 


Val 


Pro 










545 




Gly 


Gly 


Ser 


Asn 


Val 


Glu 








560 




Ser 


Thr 


Gly 


Lys 


Pro 


Val 










575 




Val 


Asn 


Leu 


Glu 


Pro 


Gin 










590 




Arg 


Val 


Thr 


He 


Ala 


GlU 








605 




He 


Cys 


Ser 


Ala 


Val 


Pro 








620 




Glu 


Ala 


He Gin Arg 





635 



145 



Lys 


Ser Arg 


Ala 


Pro 


Tyr 






160 






Ala 


Phe 


Leu 


Ser 


Gin 


His 








175 






Ser 


Glu 


Ser 


Lys 


Asp 


Thr 








190 






Ser 


Arg 


Phe 


Val 


Trp 


Lys 








205 






Glu 


Lys 


Leu 


Thr 


Ala Arg 






220 






Arg 


Thr 


Leu 


Glu 


Gin 


Leu 






235 






Leu 


Ser 


Thr , Thr 


Leu 


Ser 








250 






GlU 


Pro 


He 


Gly 


Asp 


Leu 








265 






Gin 


Ala 


He 


Lys 


Leu 


Phe 








280, 






Gly 


Val 


Lys 


Leu 


Val 


Asn 








295 






Asp 


Val, His 


Leu 


Ser 


Met 








310 




Gin 


Asn 


Val 


Arg 


Trp 


Lys 








325 




• 


Lys 


Gin 


Leu 


Gin 


Asp 


Ala 








340 






His 


Phe 


Leu 


Ser 


Ser 


Ser 








355 






Ser 


Pro 


Asn 


Lys 


Val 


Pro 








370 






Thr 


Cys 


Trp Asp 


His 


Pro 








385 






Ala 


Asp 


Leu 


Asn 


Asn 


He 






400 






Lys 


Leu Arg 


Arg 


Val 


Gin 








415 






Leu 


Thr 


Thr 


Ala 


Leu 


Glu 








430 






Ser 


Glu 


His 


Val 


Met 


Asp 








445 






Ala 


Leu 


Tyr 


Glu 


Arg 


Leu 






460 






Val 


Pro 


Leu 


Cys 


Val 


Asp 








475 






Phe 


Asp 


Ser 


Gly 


Arg 


Ser 








490 






Thr 


Gly 


He 


Ala 


Cys 


Leu 






505 






Gin 


Tyr 


Leu 


Phe 


Ser 


Gin 






520 






Gin 


Arg 


His 


Leu 


Gly Val 








535 






Arg 


Gin Leu Gly 


Glu 


Val 








550 






Pro 


Ser 


Val 


Arg 


Ser 


Cys 








5€5 






He 


GlU 


Ala 


Ser 


Gin 


Phe 








580 






Ser 


Met 


Val 


Trp 


Leu 


Pro 








595 






Gin 


Val 


Lys 


His 


Gin 


Thr 






610 






His 


Gin Gly Val 


Gin 


Val 



625 



PCT/US01/06059 







T.50 


He 


Tyr 


Ser 






165 


Pro 


Phe 


Glu 






180 


Ser 


Pro 


Lys 






195 


Gin 


Ala 


Thr 






210 


Cys 


Val 


Asp 






225 


Leu 


Glu 


He 






240 


Gin 


Ala 


Glu 






255 


Phe 


He 


Asp 






270 


Lys 


Glu 


Glu 






285 


Asp 


Leu 


Ala 






300 


GlU 


Asn 


Ser 






315 


Leu 


Gin 


Ala 






330 


His 


Arg 


Asp 






345 


Val 


Gin 


Val 






360 


Tyr 


Tyr 


He 






375 


Lys 


Met 


Thr 






390 


Lys 


Phe 


Ser 






405 


Lys 


Ala 


Leu 






420 


He 


Phe 


Asn 






435 


Val 


Val 


Glu 






450 


Glu 


Glu 


GlU 






465 


Met 


Ser 


Leu 






480 


Gly 


Lys 


Met 






495 


Cys 


Gly Thr 






510 


Val 


Ala 


Asn 






525 


Leu 


Leu 


His 






540 


Ala 


Ala 


Phe 






555 


Phe 


Arg 


Phe 






570 


Leu 


GlU 


Trp 






585 


Val 


Leu 


His 






600 


Lys 


Cys 


Ser 






615 


Pro 


Glu 


Ser 






630 
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<210> 369 
<211> 433 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> incyte ID No: LI : 410742 . 1 .orf 3 :2000MAY01 
<400> 369 



Gin 


He 


Trp 


Met 


Lys 


Ser 


Thr 


Gin 


He 


He 


Gly 


Leu 


Gin Tyr He 


1 








5 










10 






15 


Ser 


Leu 


Pro 


Leu 


Gly 


Met 


Lys 


Glu 


He 


Gly 


Leu 


Lys 


Tyr Lys Arg 










20 










25 






30 


Asp 


Pro 


Arg 


Thr 


Asn 


Glu 


Gly 


He 


Leu 


Lys 


Val 


Val 


Lys Ala Leu 










35 










40 






45 


Asp 


Tyr 


Glu 


Gin 


Leu 


Gin 


Ser 


Val 


Lys 


Leu 


Ser 


He 


Ala Val Lys 








50 










55 






60 


Asn 


Lys 


Ala 


Glu 


Phe 


His 


Gin 


Ser 


Val 


He 


Ser 


Arg 


Tyr Arg Val 










65 










' 70 






75 


Gin 


Ser 


Thr 


Pro 


Val 


Thr 


He 


Gin 


Val 


He 


Asn 


Val 


Arg Glu Gly 










80 










85 






90 


He 


Ala 


Phe 


Arg 


Pro 


Ala 


Ser 


Lys 


Thr 


Phe 


Thr 


Val 


Gin Lys Gly 










95 








100 






105 


He 


Ser 


Ser 


Lys 


Lys 


Leu 


Val 


Asp 


Tyr 


He 


Leu 


Gly 


Thr Tyr Gin 










110 










115 






120 


Ala 


He 


Asp 


Glu 


Asp 


Thr 


Asn 


Lys 


Ala 


Ala 


Ser 


Asn 


Val Lys Tyr 










125 










130 






135 


Val 


Met 


Gly 


Arg 


Asn 


Asp 


Gly 


Gly 


Tyr Leu 


Met 


He 


Asp Ser Lys 










140 










145 






150 


Thr 


Ala 


Glu 


He 


Lys 


Phe 


Val 


Lys 


Asn 


Met 


Asn 


Arg 


Asp Ser Thr 










155 








160 






165 


Phe 


He 


Val 


Asn 


Lys 


Thr 


He 


Thr 


Ala 


Glu 


Val 


Leu 


Ala He Asp 










170 










175 






180 


Glu 


Tyr 


Thr 


Gly 


Lys 


Thr 


Ser 


Thr 


Gly Thr 


Val 


Tyr 


Val Arg Val 










185 










190 






195 


Pro 


Asp 


Phe 


Asn 


Asp 


Asn 


Cys 


Pro 


Thr 


Ala 


Val 


Leu 


Glu Lys Asp 










200 










205 






210 


Ala 


Val 


Cys 


Ser 


Ser 


Ser 


Pro 


Ser 


Val 


Val 


Val 


Ser 


Ala Arg Thr 










215 










220 






225 


Leu 


Asn 


Asn 


Arg 


Tyr 


Thr 


Gly 


Pro 


Tyr Thr 


Phe 


Ala 


Leu Glu Asp 










230 










235 






240 


Gin 


Pro 


Val 


Lys 


Leu 


Pro 


Ala 


Val 


Trp 


Ser 


He 


Thr 


Thr Leu Asn 








245 








250 






255 


Ala 


Thr 


Ser 


Ala 


Leu 


Leu 


Arg 


Ala 


Gin 


Glu 


Gin 


He 


Pro Pro Gly 










260 








265 






270 


Val 


Tyr 


His 


He 


ser 


Leu 


Val 


Leu 


Thr Asp 


Ser 


Gin 


Asn Asn Arg 










275 










280 






285 


Cys 


Glu 


Met 


Pro 


Arg 


Ser 


Leu 


Thr 


Leu 


Glu 


vai 


cys 


Gin Cys Asp 










290 










295 






300 


Asn 


Arg 


Gly 


He 


Cys 


Gly 


Thr 


Ser 


Tyr 


Pro 


Thr 


Thr 


Ser Pro Gly 










305 










310 






315 


Thr 


Arg 


Tyr 


Gly 


Arg 


Pro 


His 


Ser 


Gly Arg 


Leu 


Gly 


Pro Ala Ala 










320 










325 






330 


He 


Gly 


Leu 


Leu 


Leu 


Leu 


Gly 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu Ala Pro 










335 








340 






345 


Leu 


Leu 


Leu 


Leu 


Thr 


Cys 


Asp 


Cys 


Gly Ala 


Gly 


Ser 


Thr Gly Gly 










350 










355 






360 


Val 


Thr 


Gly 


Gly 


Phe 


He 


Pro 


Val 


Pro Asp 


Gly 


Ser 


Glu Gly Thr 










365 










370 






375 


He 


His 


Gin 


Trp 


Gly 


He 


Glu 


Gly 


Ala 


His 


Pro 


Glu 


Asp Lys Glu 










380 








385 






390 


He 


Thr 


Asn 


He 


Cys 


Val 


Pro 


Pro 


Val 


Thr 


Ala 


Asn 


Gly Ala Asp 










395 










400 






405 


Phe 


Met 


Glu 


Ser 


Ser 


Glu 


Val 


Cys 


Thr 


Asn 


Thr 


Tyr 


Ala Arg Gly 










410 








415 






420 


Thr 


Ala 


Val 


Glu 


Gly 


Thr 


Ser 


Gly 


Asn Gly 


Asn 


Asp 


His 



» 
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i 

425 ■ 430 

<210> 370 
<211> 531 
<212> PRT 

<213> Homo 1 sapiens 
<220> 

<221> misc_f eatiire 

<223> Incyte ID No: LG:406568 . 1 .orf 3 :2000MAY19 
<220> 

<221> unsure 

<222> 148, 291 

<223> unknown or other 

<400> 370 



Ala 


Cys 


His 


Leu 


Pro 


Pro 


Cys 


Leu 


Leu 


Leu 


Ala 


Ala Leu 


Asn 


Ala 


1 






5 








10 , 








15 


Trp 


Ser 


Phe 


Lys 


Leu 


Leu 


Leu 


Gly 


Leu 


Thr 


Lys 


Gin Gly 


Pro 


Cys 








20 










25 








3 P 


Leu 


Pro 


Leu 


Ala 


Thr 


Glu 


GlU 


Asp, Ser Val 


Asn 


Thr Asn 


Pro 


Ser 










35 










40 




dlu Glu 




45 


Thr 


Glu 


Asp 


Glu 


Leu 


Leu 


Ala 


Ser 


Leu 


Ser 


Ala 


Leu 


Lys 








50 










55 








60 


Glu 


Leu 


Glu 


Arg 


Glu 


Leu 


GlU 


Asp 


He 


Glu 


Pro 


Asp Arg 


Asn 


Leu 








65 










70 








75 


Pro 


Val 


Gly 


Leu 


Arg 


Gin 


Lys 


Ser 


Leu 


Thr 


Glu 


Lys Thr 


Pro 


Thr 








80 








85 








90 


Gly 


Thr 


Phe 


Ser 


Arg 


Glu 


Ala 


Leu 


Met 


Ala 


Tyr 


Trp Glu Lys 


Glu 








95 










100 








105 


Ser 


Gin 


Lys 


Leu 


Leu 


Glu 


Lys 


Glu Arg Leu Gly 


Glu Cys 


Gly Lys 








110 










115 








120 


Val 


Ala 


Glu 


Asp 


Lys 


Glu 


GlU 


Ser 


Glu 


GlU 


Glu 


Leu He 


Phe 


Thr 








125 










130 








135 


Glu 


Ser 


Asn 


Ser, 


Glu 


Val 


Ser 


Glu 


Glu 


Val 


Tyr 


Thr Xaa 


Glu 


GlU 










140 










145 






150 


Glu 


Glu 


Glu 


Ser 


Gin 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Asp Ser Asp 


Glu 










155 










160 








165 


Glu 


*Glu 


Arg 


Thr 


He 


Glu 


Thr 


Ala 


Lys 


Gly 


lie 


Asn Gly Thr 


Val 








170 








175 








180 


Asn 


Tyr 


Asp 


Ser 


Val 


Asn 


Ser 


Asp 


Asn 


Ser 


Lys 


Pro Lys 


He 


Phe 






185 










190 








195 


Lys 


Ser 


Gin 


He 


Glu 


Asn 


He 


Asn 


Leu 


Thr 


Asn 


Gly Ser Asn 


Gly 








200 










205 








210 


Arg 


Asn 


Thr 


Glu 


Ser 


Pro 


Ala 


Ala 


He 


His 


Pro 


Cys Gly Asn 


Pro 








215 










220 








225 


Thr 


Val 


He 


Glu 


Asp 


Ala 


Leu 


Asp Lys 


He 


Lys 


Ser Asn Asp 


Pro 










230 










235 








240 


Asp 


Thr 


Thr 


Glu 


Val 


Asn 


Leu 


Asn 


Asn 


He 


Glu 


Asn He 


Thr 


Thr 








245 










250 








255 


Gin 


Thr 


Leu 


Thr 


Arg 


Phe 


Ala 


Glu 


Ala 


Leu 


Lys 


Asp Asn 


Thr 


Val 










260 










265 






270 


Val 


Lys 


Thr 


Phe 


Ser 


Leu 


Ala 


Asn 


Thr 


His 


Ala 


Asp Asp 


Ser 


Ala 








275 










280 








285 


Ala 


Met 


Ala 


He 


Ala 
,290 


Xaa 


Met 


Leu 


Lys 


Val 
295 


Asn 


Glu His 


He 


Thr 
300 


Asn 


Val 


Asn 


Val 


Glu 


Ser 


Asn 


Phe 


He 


Thr 


Gly 


Lys Gly 


He 


Leu 










305 










310 




315 


Ala 


lie 


Met 


Arg 


Ala 


Leu 


Gin 


His 


Asn 


Thr 


Val 


Leu Thr 


Glu 


Leu 








320 










325 








330 


Arg 


Phe 


His 


Asn 


Gin 


Arg 


His 


He 


Met 


Gly 


Ser 


Gin Val 


Glu 


Met 








335 








340 








345 


Glu 


He 


Val 


Lys 


Leu 


Leu 


Lys 


Glu 


Asn 


Thr 


Thr 


Leu Leu 


Arg 


Leu 










350 








355 








360 


Gly 


Tyr 


His 


Phe 


Glu 


Leu 


Pro 


Gly Pro Arg Met 


Ser Met 


Thr 


Ser 






365 










370 








375 
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lie Leu Thr Arg^sn Met Asp Lys Gin Arg Gin Lys^rg Leu Gin 

380 "* 385 390 

Glu Gin Lys Gin Gin Glu Gly Tyr Asp Gly Gly Pro Asn Leu Arg 

395 400 „ 405' 

Thr Lys Val Trp Gin Arg Gly Thr Pro Ser Ser Ser Pro Tyr Val 

410 1 415 420 

Ser Pro Arg His Ser Pro Trp Ser Ser Pro Lys Leu Pro Lys Lys 

425 430 435 

Val Gin Thr Val Arg Ser Arg Pro Leu Ser Pro Val Ala Thr Pro 

440 445 450 

Pro Pro Pro Arg Asp Ser Ser Thr Pro Arg Glu Lys Ala His Tyr 

455 460 465 

Gin Lys His Cys Arg Ser His Gin Thr Thr Gly Glu Cys Pro Thr 

470 475 1 480 

Gly lie Thr Lys Trp Thr Lys Lys Glu Lys Arg Glu Lys Gly Gin 

485 490 495 
Glu Thr Ala Lys Gin Tyr Ser Lys Gly Asn Lys Lys Phe Ser Glu 

500 -505 510 

Val Ser Ala Arg Glu Glu Asn Gly Arg Gin Phe Pro Thr Phe Tyr 

515 520 525 

Pro Thr Glu lie Ser Ser 
530 

<210> 371 
<211> 257 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

,<223> Incyte ID No: LI : 283762 . 1 .orf 2 :2000MAY01 
<400> 371 

Lys Ala Phe Phe lie Trp Trp Gin Ser Cys Trp Asn Arg Gly Asn 
15 10 15 

Gin Ser Phe Lys Phe Leu Glu Gin lie Leu Trp Ser Asn Leu Gin 
20 25 30 

lie Leu Lys Lys Thr His His Pro Thr His Arg Arg Tyr Asp Phe 
35 40 45 

Phe Val Ser Arg Phe Ser Ala Met Cys His Ser Cys His Ser Asp 
50 55 60 

Pro Glu lie Arg Thr Glu lie Arg lie Ala Gly lie Arg Gly lie 
65 70 75 

Gin Gly Val Val Arg Lys Thr Val Asn Asp Glu Leu Arg Ala Thr 
80 85 90 

lie Trp Glu Pro Gin His Met Asp Lys He Val Pro Ser Leu Leu 
95 100 105 

Phe Asn Met Gin Lys He Glu Glu Val Asp Ser Arg He Gly Pro 
110 115 ' 120 

Pro Ser Ser Pro Ser Ala Thr Asp Lys Glu Glu Asn Pro Ala Val 
125 130 135 

Leu Ala Glu Asn Cys Phe Arg Glu Leu Leu Gly Arg Ala Thr Phe 
140 145 150 

Gly Asn Met Asn Asn Ala Val Arg Pro Val Phe Ala His Leu Asp 
155 160 165 

His His Lys Leu Trp Asp Pro Asn Glu Phe Ala Val His Cys Phe 
170 175 180 

Lys He He Met Tyr Ser He Gin Ala Gin Tyr Ser His His Val 
185 190 . 195 

He Gin Glu He Leu Gly His Leu Asp Ala Arg Lys Lys Asp Ala 
200 205 210 

Pro Arg Val Arg Ala Gly He He Gin Val Leu Leu Glu Ala Val 
215 220 225 

Ala He Ala Ala Lys Gly Ser He Gly Pro Thr Val Leu Glu Val 
230 ~ 235 240 

Phe Asn Thr Leu Leu Lys His Leu Arg Leu Ser Val Glu Phe Glu 
245 250 255 
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Ser Lys 

<210> 372 
<211> 242 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LI : 347687 . 113 . orf 1 : 2 000MAY01 
<400> 372 

Gin Pro Cys Gly Phe Gin Gly Ala Glu Asn Arg Arg Lys Leu Ala 

1 5 10 15 

Tyr Met Arg Thr Asp Trp Pro Glu Glu Gin Gin Leu Leu Val Ala 

20 25 30 

Leu Phe Cys Gly Cys Gly His Glu Ala Leu Thr Gly Thr Glu Lys 

35 40 45 

Leu lie Glu Thr Tyr Phe Ser Lys Asn Tyr Gin Asp Tyr Glu Tyr 

50 55 60 

Leu lie Asn Val lie His Ala Phe Gin Tyr Val lie Tyr Gly Thr 

•65 70 75 

Ala Ser Phe Phe Phe Leu Tyr Gly Ala Leu Leu Leu Ala Glu Gly 

80 ~ 85 90 

Phe Tyr Thr Thr Gly Ala Val Arg Gin. lie Phe Gly Asp Tyr Lys 

95 100 105 

Thr Thr He Cys Gly Lys Gly Leu Ser Ala Thr Phe Val Gly He 

110 115 120 

Thr Tyr Ala Leu Thr Val Val Trp Leu Leu Val Phe Ala Cys Ser 

125 130 135 

Ala Val Pro Val Tyr He Tyr Phe Asn Thr Trp Thr Thr Cys Gin 

140 145 150 

Ser He Ala Phe Pro Ser Lys Thr Ser Ala Ser He Gly Ser Leu 

155 160 165 

Cys Ala Asp Ala Arg Met Tyr Gly Val Leu Pro Trp Asn Ala Phe 

170 175 180 

Pro Gly Lys Val Cys Gly Ser Asn Leu Leu Ser* He Cys Lys Thr 

185 190 195 

Ala Glu Phe Gin Met Thr Phe His Leu Phe He Ala Ala Phe Val 

200 205 210 

Gly Ala Ala Ala Thr Leu Val Ser Leu Leu Thr Phe Met He Ala 

215 220 225 
Ala Thr Tyr Asn Phe Ala Val Leu Lys Leu Met Gly Arg Gly Thr 

230 235 240 

Lys Phe 

<210> 373 

<211> 60 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 1146510 . 1 . orf 2 : 20 00MAY01 
<400> 373 

Thr Glu Leu Gin Arg Pro Arg Ser Ala Ser He Tyr Ser Arg Tyr 

15 10 15 
Ala Ser Ser Asn Phe Arg Arg Cys Gly Val Glu Leu Lys Tyr Ser 

20 25 30 
Ser Lys Asn Arg Glu Met Thr Asp Thr Thr Asp Ala Val His Asn 

35 40 45 
Cys Asn Ala Arg Tyr Ser Ser Pro Arg Pro Lys Leu Leu Glu Asp 

50 55 60 
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<210> 374 
<211> 157 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> incyte ID No: LG: 451710 . 1 .orfl :2000FEB18 
<400> 374 



Pro 






Asn 


T">-v*^-\ Jl 1 q T5v/^ Dva Dvn 

JrxO Ala iriO iriO iriO riO 


Arg Val His 


Pro 


X 






c 

D 


1 n 




15 


Ser 


feer 


irp Arg 


Axa 


Pro He Gin Glu Met Ala 


Val 1 Pro Leu 


Leu 








ZJ 




30 


Thr 


Lys 


Lys lie 


Val 


Lys Lys Arg Val Lys Gin 


Phe Lys Arg 


Pro 








35 


. 40 




45 


His 


Leu 


Asp Arg 


Tyr 


Lys Cys Leu Lys Pro Ser 


Trp Arg Arg 


Pro 








50 


55 




60 


Lys 


Gly 


lie Asp 


Ser 


Arg Val Arg Arg Lys Phe 


Lys Gly Cys 


Thr 








65 


70 




75 


Leu 


Met 


Pro Asn 


He 


Gly Tyr Gly Ser Asp .Lys 


Ser Thr Arg His 








80 


85 




90 


Tyr 


Leu 


Pro Asn 


Lys 


Phe Lys Lys . Phe Val Val 


His Asn Val 


Ser 








95 


100 




105 


Glu 


Leu 


Glu Leu 


Leu 


Met Met His Asn Arg Thr 


T^yr Cys Ala 


Glu 








110 


' H5 




120 


lie 


Ala 


His Asn 


Val 


Ser Thr Lys Lys Arg Lys 


Glu lie Val 


Glu 








125 


130 




135 


Arg 


Ala 


Ala Gin 


Leu 


Asp He Val Val Thr Asn 


Lys Leu Ala Arg 








140 


145 




150 


Leu 


Arg 


Ser Gin 


Glu 


Asp Glu 












155 







<210> 375 
<211> 158 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 455771 . 1 . orf 3 : 2000FEB18 
<400> 375 



Ala 


Ser 


Cys Ser Arg 


Arg Arg Glu Ala 


Leu 


Gin 


Arg Thr Ser 


Val 


1 




5 




10 






15 


Asn 


Met 


Gly Lys Thr 


Arg Gly Met Gly 


Ala 


Gly 


Arg Lys Leu 


Lys 






20 




25 






30 


Thr 


His 


Arg Arg Asn 


Gin Arg Trp Ala 


Asp 


Lys 


Ala Tyr Lys 


Lys 






35 




40 






45 


Ser 


His 


Leu Gly Asn 


Glu Trp Lys Lys 


Pro 


Phe 


Ala Gly Ser 


Ser 






50 




55 






60 


His 


Ala 


Lys Gly He 


Val Leu Glu Lys 


He 


Gly 


He Glu Ala 


Lys 






65 




70 






75 


Gin 


Pro 


Asn Ser Ala 


He Arg Lys Cys 


Ala 


Arg 


Val Gin Leu 


Val 






80 


85 




90 


Lys 


Asn 


Gly Lys Lys 


He Ala Ala Phe 


Val 


Pro 


Asn Asp Gly Cys 






95 




100 






105 


Leu 


Asn 


Tyr He Glu 


Glu Asn Asp Glu 


Val 


Leu 


He Ala Gly 


Phe 






110 


115 






120 


Gly 


Arg 


Lys Gly His 


Ala Val Gly Asp 


He 


Pro 


Gly Val Arg 


Phe 






125 




130 






135 


Lys 


Val 


Val Lys Val 


Ser Gly Val Ser 


Leu 


Leu 


Ala Leu Phe 


Lys 






140 


145 






150 


Glu 


Lys 


Lys Glu Lys 


Pro Arg Ser 














155 











<210> 376 
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<211> 238 
<212> PRT 

<213> Homo sapiens ' 
<220> 

<221> mis cofeature 

<223> Incyte. ID No: LG:452089 . 1 .orf 2 :2000FEB18 
<400> 376 



Leu 


Gly 


Lys 


His 


Arg 


Arg 


Pro Pro Pro Pro 


Lys 


Asp Gly Arg Arg 


1 








5 




10 




15 


Arg 


Gly 


Tyr 


Gly 


Arg 


Leu 


Leu Ala Pro Met 


Ala 


His Glu Lys Lys 






20 




25 




30 


Leu 


Ser 


Asn 


Pro 


Met 


Arg Glu He Lys ,Val 


Gin 


Lys Leu Val Leu 










35 




40 




45 


Asn 


He 


Ser 


Val 


Gly 


Glu 


Ser Gly Asp Arg 


Leu 


Thr Arg Ala Ala 










50 




55 




60 


Lys 


Val 


Leu 


GlU 


Gin 


Leu 


Ser Gly Gin Thr 


Pro 


Val Phe Ser Lys 








65 




70 , 




75 


Ala 


Arg 


Tyr 


Thr 


Val 


Arg 


Ser Phe Gly He 


Arg 


Arg Asn Glu Lys 








80 




85 




9,0 


He 


Ala 


Cys 


Tyr 


Val 


Thr 


Val Arg, Gly Glu 


Lys 


Ala Met Gin Leu 






95 




100 




105 


Leu 


Glu 


Ser 


Gly 


Leu 


Lys 


Val Lys Glu Tyr 


Glu 


Leu Leu Arg Arg 








110 


115 




120 


Asn 


Phe 


Ser 


Asp 


Thr 


Gly Cys Phe Gly Phe Gly 


lie Gin Glu His 








125 




130 




135 


He 


Asp 


Leu 


Gly 


He 


Lys 


Tyr Asp Pro Ser 


Thr 


Gly He Tyr Gly 






140 




145 




150 


Met 


Asp 


Phe 


Tyr 


Val 


Val 


Leu Glu Arg Ala 


Gly 


Tyr Arg Val Ala 






155 




160 




165 


Arg 


Arg 


Arg 


Arg 


Cys 


Lys 


Ser Arg Val Gly 


He 


Gin His Arg Val 








170 




175 




180 


Thr 


Lys 


Glu 


Asp 


Ser 


Met 


Lys Trp Phe Gin 


Val 


Lys Tyr Glu Gly 






185 




190 




195 


Val 


He 


Leu 


Lys, 


Gin 


Gly Ser Gly Leu His Val 


Pro Pro Leu Thr 








200 




205 




210 


Cys 


Gly 


Gin 


Asn 


Ser 


Ser 


Leu Val Ser Ser 


Pro 


Pro Pro Cys Gin 






215 




220 




225 


Arg 


Lys 


Thr 


Thr 


Thr 


His 


Leu Ala Arg Leu 


Phe 


Trp Val 






230 




235 







<210> 377 
<211> 102 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG : 246415 . 1 . orf 3 : 2000FEB18 
<400> 377 



Leu 


Pro 


Pro 


Val Arg 


Ala 


Ser 


Asn Met 


Met 


Lys Lys 


Arg Arg Asn 


1 






5 








10 




15 


Asn 


Gly Arg 


Thr Lys 


Lys 


Gly 


Arg Gly 


His 


Val Gin 


Pro He Cys 








20 








25 




30 


Asp 


Thr 


Asn 


Cys Ala 


Gin 


Cys 


Val Pro 


Lys 


Asp Lys 


Ala He Asn 






35 








40 




45 


Lys 


Phe 


He 


He Gly 


Asn 


Thr 


Val Glu 


Ala 


Ala Ala Val Arg Asp 






50 








55 




60 


He 


Ser 


Glu 


Ala Ser 


Val 


Phe 


Asp Ala 


Tyr 


Val Leu 


Pro Lys Leu 








65 






70 




75 


Tyr 


Leu 


Lys 


Leu His 


Tyr 


Cys 


Leu Ser 


Cys 


Ala He 


His Ser Arg 








80 






85 




90 


Val 


Val 


Arg 


Asn Arg 


Ser 


Cys 


Glu Ala 


His 


Lys Asp 










95 






100 
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<210> 378 
<211> 102 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LG:414144 . 10 .orf 1 :2000FEB18 
<400> 378 

Val Tyr Trp Cys Arg Glu Leu He Leu Lys Arg Gly Gin Ala Lys 

15 10 15 

Val Lys Asn Lys Thr He Pro Leu Thr Asp Asn Thr Val He Glu 

20 25 ' 30 

Glu His Leu Gly Lys Phe Gly Val He Cys Leu Glu Asp Leu He 

35 40 45 

His Glu He Ala Phe Pro Gly Lys His Phe Gin Glu He Ser Trp 

50 55 60 

Phe Leu Cys Pro Phe His Leu Ser Val Ala Arg His Ala Thr Lys 

65 70 75 

Asn Arg Val Gly Phe Leu Lys Glu Met Gly Thr Pro Gly Tyr Arg 

80 85 90 

Gly Glu Arg He Asn Gin Leu lie Arg Gin Leu Asn.. 

95 .100 

<210> 379 
<211> 177 
<212> PRT 
<213> Homo sapiens 

,<220> 

<221> misc_feature 

<223> Incyte ID No: LG : 1101445 . 1 . orf 3 : 2000FEB18 
<400> 379 

Gly Thr Met Glu Ala Val Pro Glu Lys Lys Lys Lys Val Ala Ala 

15 10 15 

Ala Pro Gly Thr Leu Lys Lys Lys Lys Val Pro Ala Val Pro Glu 

20 25 30 

Thr Leu Lys Lys Lys Arg Arg Asn Phe Ala Glu Leu Lys Val Lys 

35 40 45 

Arg Leu Arg Lys Lys Phe Ala Leu Lys Thr Leu Arg Lys Ala Arg 

50 55 60 

Arg Lys Leu He Tyr Glu Lys Ala Lys His Tyr His Lys Glu Tyr 

65 ~ 70 75 

Arg Gin Met Tyr Arg Thr Glu He Arg Met Ala Arg Met Ala Arg 

80 85 • 90 

Lys Ala Gly Asn Phe Tyr Val Pro Ala Glu Pro Lys Leu Ala Phe 

95 100 1 105 
Val He Arg He Arg Gly He Asn Gly Val Ser Pro Lys Val Arg 

110 115 120 
Lys Val Leu Gin Leu Leu Arg Leu Arg Gin He Phe Asn Gly Thr 

125 130 135 
Phe Val Lys Leu Asn Lys Ala Ser Val Asn Met Leu Arg He Val 

140 145 150 
Glu Pro Tyr He Ala Trp Gly Val Pro Gin Pro Glu Val Ser Lys 

155 ■ 160 165 

Arg Ala His Leu Gin Thr Arg Leu Trp Gin Asn Gin 

170 175 

<210> 380 

<211> 86 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 
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WO 01/62927 
<223> Incyte ID No: 
<400> 380 

Leu Pro Ser Tyr Ala 
1 5 
Ser His Ser Glu Thr 
20 

Glu Arg Pro Ala Arg 
35 

Tyr Arg Gin lie Ala 
50 

Ala Glu Ala Gly Ala 
65 

Arg Phe. Gly Cys Gly 
80 

<210> 381 
<211> 97 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_f eature 
<223> Incyte ID No: LI : 903021 . 1 . orfl : 2000FEB01 

<400> 381 



Arg 


Asn 


Phe 


Thr 


GlU 


Leu 


Lys 


He Lys 


Arg 


Leu Arg Asn Lys 


Phe 


1 








5 






10 




15 


Ala 


Gin 


Lys 


Met 


Leu 


Leu 


Lys 


Ala Arg 


Arg 


Lys Leu He Tyr 


Glu 








20 






25 




30 


Lys 


Ala 


Lys 


His 


Tyr 
35 


His 


Lys 


Glu Tyr 


Met 
40 


Gin Met Tyr Arg 


Thr 
45 


Glu 


He 


Gin 


He 


Ser 


Arg 


He 


Ala Arg 


Lys 


Ala Gly Asn Phe 


Tyr 










50 






55 




60 


Val 


Ser 


Ala 


Glu 


Pro 


Lys 


Leu 


Ala Phe 


Val 


He Arg He Gly 


Gly 










65 






70 




75 


Tyr 


Gin 


Leu 


Gly 


Glu 


Pro 


Lys 


Gly Leu 


Lys 


Gly Val Ala Thr 


Ser 








80 






85 




90 


Leu 


Pro 


Ser 


Ser 


Asn 
95 


Leu 


Gin 











L^^ 



PCT/US01/060S9 

. t 

"52134 . 1 . orf 2 :2000FEB18 



Pro Glu He yal 


Pro 


ax a inr ueu Arg 


Arg 


1U 






Gly Arg Pro Gin 


Pro 


jjys <jiy Lieu Lys 


Gly 


25 




30 


Leu Thr Arg Gly 


Glu 


Ala Asp Arg Asp 


Thr 




40 




45 


Val Pro Pro Asp 


Ala 


Asp Arg Lys Ala 


Glu 


55 




60 


Gly Ser Glu Thr 


Glu 


Phe Gin Phe Arg 


Gly 


70 




75 


Gly Gly Gin Pro 


Pro 


Gin 




85 







<210> 382 

<211> 82 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: LI : 246422 . 1 . orfl : 2000FEB01 



<400> 382 



Lys 


Lys 


Arg 


Lys 


Gin 


Val 


Pro 


1 








5 






Pro 


Val 


Glu 


Asp 


Gly 
20 


He 


Met 


Phe 


Gin 


Glu 


Arg 


He 
35 


Lys 


Met 


Gly Gly 


Val 


Val 


Thr 


His 


Arg 










50 






Asp 


He 


Gin 


Ala 


Ala 


Leu 


Phe 








65 






Gin 


Lys 


He 


Ser 


Glu 
80 


Glu 


Glu 



Lys 


Phe 


Thr 


Leu Asp Arg Thr 


His 




10 






15 


Asp 


Ala 


Ala 


Asn Phe Glu 


Gin 


Phe 




25 






30 


Asn 


Gly 


Lys 


Ala Gly Asn 


Phe 


Gly 






40 






45 


Arg Glu 


Gin 


Glu Gin Asp 


Gin 


Arg 






55 






60 


Gin 


Gin 


Val 


Phe Glu He 


Ser 


His 






70 






75 



<210> 383 
<211> 180 
<212> PRT 
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PCT/US01/06059 



<220> 

<221> misc_feature , 

<223> incyte ID No: LG : 449404 . 1 .orfl : 2000MAY19 
<400> 383 



Cor 




Fro 


Lys Ser 


Cys 


Leu 


Arg t>er v»iy nis 


fZO ici b€U X11S 


1 






i 1 c 




in 

1 v 




Aia 


Tnr 


wec 


Ser .Arg 


Arg 


Lys 


mr Arg v^iu Fro 


T r ft f * ] it /■ i T « « T\ t»l*l 

jjys biu Liiu Asn 














O U 


vai 


Tnr 


Leu 


vjiy Fro 


Tnr 


vai 


Arg vjiu vaiy i^iu 


iyx vai jtrne v^iy 


















vai 


T\ "1 -» 

Ala 


HIS 


i l e Fne 


Ala 


Ser 


pne Asn Asp mr 


Fne lie His lie 








en 








ou 


Tnr 


Asp 


Leu 


Ser Gly 


TV -vr*r 

Arg 


G1U 


Tnr Leu ■ vai Arg 


lie inr t»iy t»iy 








65 






70 


75 


Met 


Lys 


Vai 


Lys Ala 


Asp 


Arg 


Asp Glu Ser Ser 


Pro Tyr Ala Ala 








80 






85 


90 


Met 


Leu 


Ala 


Ala Gin 


Asp 


Vai 


Ala Gin Arg Cys 


Lys Glu Leu Gly 








95 






100 • 


105 


lie 


Thr 


Ala 


Leu His 


He 


Lys 


Leu Arg Ala Thr 


Gly Gly Asn Lys 








110 




. 115 


120 


Thr 


Lys 


Thr 


Pro Gly 


Pro 


Gly 


Ala Gin Ser Ala 


Leu Arg Ala Leu 








125 




130 


135 


Ala 


Arg 


Ser 


Gly Met 


Lys 


He 


Gly Arg He Glu 


Asp Vai Thr Pro 








140 






145 


150 


Vai 


Pro 


Thr 


Asp Ser 


Thr 


Arg 


Arg Lys Gly Gly 


Arg Arg Gly Lys 








155 






160 


165 


Glu 


Asp 


Cys 


Arg Arg 


His 


His 


Tyr Cys Vai Pro 


Phe Ala Gly Ser 








170 






175 


180 



<210> 384 
<211> 118 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 449413 . 1 .orf 3 :2000MAY19 
<400> 384 



Pro 


Arg 


Cys 


Phe Arg 


Leu 


Pro Gin Arg 


Arg 


Arg 


Pro Ser Gin Pro 


1 






5 






10 




15 


Vai 


Pro 


Ser 


Ser Ala 


Thr 


Met Gly Lys 


Thr 


Arg 


Gly Met Gly Ala 








20 






25 




30 


Gly 


Arg 


Lys 


Leu Lys 


Thr 


His Arg Arg 


Asn 


Gin 


Arg Trp Ala Asp 








35 






40 




45 


Lys 


Ala 


Tyr 


Lys Lys 


Ser 


His Leu Gly 


Asn 


Glu 


Trp Lys Lys Pro 








50 






55 




60 


Phe 


Ala 


Gly 


Ser Ser 


His 


Ala Lys Gly 


He 


Vai 


Leu Glu Lys lie 








65 






70 




75 


Gly 


He 


Glu 


Ala Lys 


Gin 


Pro Asn Ser 


Ala 


lie 


Arg Lys Cys Ala 








80 






85 




90 


Arg 


Vai 


Gin 


Leu Vai 


Lys 


Asn Gly Lys 


Lys 


He 


Ala Ala Phe Vai 








95 






100 




105 


Pro 


Asn 


Asp 


Gly Cys 


Leu Asn Tyr He 


Glu 


Glu 


Asn Vai 








110 






115 







<210> 385 
<211> 164 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_Ji eature 
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<223> Incyte ID No: LG^50105 . 1 . orf 2 : 2000MAY19 



<400> 385 

Pro Ser Ala Ala Arg Pro Pro Tyx Ser Arg Tyr Arg Ala Arg Arg 
1 5 10 15 

His His Leu Arg Arg Ser Asn Met Gly Lys Thr Arg Gly Met Gly 
20 25 30 

Ala Gly Arg Lys Leu Lys Thr His Arg Arg Asn Gin Arg Trp Ala 
35 40 45 

Asp Lys Ala Tyr Lys Lys Ser His Leu Gly Asn Glu Trp Lys Lys 
50 55 60 

Pro Phe Ala Gly Ser Ser His Ala Lys Gly lie Val Leu Glu Lys 
65 70 75 

lie Gly lie Glu Ala Lys Gin Pro Asn Ser Ala He Arg Lys Cys 
80 85 90 

Ala Arg Val Gin Leu Val Lys Asn Gly Lys Lys He Ala Ala Phe 
95 100 105 

Val Pro Asn Asp Gly Cys Leu Asn Tyr He Glu Glu Asn Asp Glu 
110 . 115 120 

Val Leu He Ala Gly Phe Gly Arg Lys Gly His Ala Val Gly Asp 
125 130 135 

He Pro Gly Val Arg Phe Lys Val Val Lys Val Ser Gly Val Ser 
140 145 150 

Leu Leu Ala Leu Phe Lys Glu Lys Lys Glu Lys Pro Arg Ser 
155 160 

<210> 386 
<211> 101 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No : LG: 460809 . 1 . orf 3 :2000MAY19 
<400> 386 

Ala Trp Val Glu Trp Ala Ser Arg Ser Ala Pro Arg Ala His Arg 

15 10 15 

Glu He Gin Lys Phe Ala Met Lys Glu Met Gly Thr Pro Asn Leu 

20 25 30 

His He Asp Val Arg Leu Asn Lys Ala Leu Trp Ala Lys Gly He 

35 40 45 

Arg Asn Val Pro Tyr His He His Met Lys Leu Pro Arg Lys Leu 

50 55 60 

Asn Glu Asp Glu Asp Ser Pro Asp Lys Leu Tyr Ala Leu Val Pro 

65 70 75 

Thr Tyr Thr Cys Tyr His Phe His Lys Ser He Asp- Arg Gin Cys 

80 85 90 
Gly Arg Glu Leu Thr Thr Asp Gly Ser He His 

95 100 

<210> 387 
<211> 259 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LG:481781 . 1 .orf 3 :2000MAY19 
<400> 387 

Arg Ser Val Arg Arg Arg Ser Ser Ser Ser Arg Arg Arg Arg Val 

1 5 10 15 

Ala Ala Pro His Leu Glu Leu Ala Thr Met Ala Arg Gly Leu Lys 

20 25 30 

Lys His Leu Lys Arg Leu Asn Ala Pro Lys His Trp Met Leu Asp 

35 40 45 
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Lys 


Leu 


Gly 


Gly* 


m 

IHa 


Phe Ala Pro Lys Pro Ser 


Ser^Wy Pro 


His 






50 


55 




60 


Lys 


Ser 


Arg 


Glu 


Cys 


Leu Pro Leu lie Leu lie 


He Arg Asn 


Arg 








65, 


70 




75 


Leu 


Lys 


Tyr 


Ala 


Leu 


Thr Tyr Arg Glu Val He 


Ser He Leu 


Met 










80 


85 




90 


Gin 


Arg 


His 


Val 


Leu 


Val Asp Gly Lys Val Arg 


Thr Asp Lys 


Thr 






i 




. 95 


100 




105 


Tyr 


Pro 


Ala 


Gly Phe 


Met Asp Val He' Ser He 


Pro Lys Thr 


Asn 








■ 110 


115 




120 


GlU 


Asn 


Tyr 


Ara 


Leu 


Leu Tyr Asp Thr Lys Gly 


Arg Phe Arg 


Leu 










125 


130 




135 


His 


Pro 


lie 


Arg 


Asp 


Glu Asp Ala Lys Phe Lys 


Leu 1 Cys Lys 


Val 










140 


145 


Tyr Leu Asn 


150 


Arg 


Ser 


Val 


Gin 


Phe 


Gly Gin Lys Gly He Pro 


Thr 










155 


160 




165 


Tyr 


Asp 


Gly 


Arg 


Thr 


He Arg Tyr Pro Asp Pro 


Leu lie Lys 


Ala 










170 


175 




180 


Asn 


Asp 


Thr 


tie 


Lys 


He Asp Leu Glu Thr Asn 


Lys He Val 


Asp 










185 


190 




195 


Phe 


lie 


Lys 


Phe 


Asp 


Val Gly Asn Val Val • Met 


Val Thr Gly 


Gly 








200 


205 




210 


Arg 


Asn 


Thr 


Gly 


Arg. 


Val Gly Val * He Lys Asn 


Arg Glu Lys 


His 










215 


220 




225 


Lys 


Gly 


Ser 


Phe 


Glu 


Thr He His Val Glu Asp 


Ser Trp Ala 


Thr 








230 


235 




240 


Gly 


Ser 


Pro 


Pro 


Val 


Trp Ala Thr Cys Ser Pro 


Ser Ala Arg 


Val 










245 


250 




255 


lie 


Ser 


Arg 


Gly 
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<210> 388 
<211> 184 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No : LG: 1101153 . 1 .orf 2 :2000MAY19 
<400> 388 



Arg 


Arg Arg Pro Phe 


Leu Leu Arg Ser 


Phe 


Ala Ala Ala Met 


Val 


1 


5 




10 




15 


Lys 


Tyr ser Gin Glu 


Pro Gly Asn Pro 


Thr 


Lys Ser Ala Lys 


Ala 




20 




25 




30 


Met 


Gly Arg Asp Leu 


Arg Val His Phe 


Lys 


Asn Thr Arg Glu 


Thr 




35 




40 




45 


Ala 


Phe Ala Leu Arg 


Lys Leu Pro Leu 


Thr 


Lys Ala Lys Arg 


Tyr 




50 




55 




60 


Leu 


Glu Asp Val He 


Ala His Lys Gin 


Ala 


He Pro Phe Arg 


Arg 




65 




70 




75 


Tyr 


Cys Gly Gly Val 


Gly Arg Thr Ala 


Gin 


Ala Lys Ser Arg 


His 




80 




85 




90 


Ser 


Asn Gly Gin Gly 


Arg Trp Pro Val 


Lys 


Ser Ala Arg Phe 


He 




95 




100 




105 


Lea 


Asp Leu Leu Lys 


Asn Ala Glu Ser 


Asn 


Ala Asp Val Lys 


Gly 




110 




115 




120 


Leu 


Asp Val Asp Asn 


Leu Tyr Val Ser 


His 


He Gin Val Asn 


Gin 




'125 




130 




135 


Ala 


Gin Lys Gin Arg 


Arg Arg Thr Tyr 


Arg 


Ala His Gly Arg 


He 




140 




145 




150 


Asn 


Pro Tyr Met Ser 


Ser Pro Cys His 


He 


Glu Leu He Leu 


Ser 




155 


160 




165 


Glu 


Lys Glu Glu Pro 


Val Lys Lys Glu 


Ala 


Asp Asn He Val 


Ala 




170 


175 




180 


Ala 


Arg Lys Gin 
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<210> 389 
<211> 152 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No : LI : 257695 . 20 . orf 2 : 2000MAY01 
<400> 389 



Ala 


Pro 


Arg 


Gly 


Glu 


Gly 


Cys 


Leu Val His Ala Ser . Glu 


Pro Cys 


1 






5 


• 


10 


15 


Arg 


Pro 


Arg 


Ala 


Arg 


Cys 


Ser 


Leu Cys Arg Ser Ser Asp 


Ala Arg 










20 






. 25 


30 


Arg 


Gin 


Arg 


Gin 


Leu 


Trp 


Ala 


His Cys Lys Arg Gly Asn 


Gly Leu 










35 






40 


45 


He 


Lys 


Val 


Asn 


Gly 


Arg 


Pro 


Leu Glu Met He Glu Pro 


Arg Thr 








50 




55 


60 


Leu 


Gin 


Tyr 


Lys 


Leu 


Leu 


Glu 


Pro Val Leu, Leu Leu Gly 


Lys Glu 






65 






70 


75 


Arg 


Phe 


Ala 


Gly 


Val 


Asp 


He 


Arg, Val Arg Val Lys Gly 


Gly Gly 










80 






85 


90 


His 


Val 


Pro 


Gin 


He 


Tyr 


Gly 


Glu Ser Gin Glu Leu Gly 


Ala Trp 










95 


100 


105 


Arg 


Arg 


Trp 


Leu 


Trp 


Glu 


Gly 


Gly Leu His Ser Ala Pro 


Val Pro 










110 






115 


120 


Phe 


Asn 


Cys 


Val 


Ser 


Phe 


Ser 


Gin Leu Ser Val Ser Pro 


Ser Pro 








125 






130 


135 


Lys 


Pro 


Trp 


Trp 


Pro 


He 


Thr 


Arg Asn Val Ser Glu His 


Gly Ser 










140 






145 


150 


Phe 


Pro 

















<210> 390 

<211> 158 

<212> PRT , . 

<213> Homo sapiens 

<220> 

<221> mi sc_f eature 

<223> Incyte ID No: LI : 455771 . 1 .orf 3 : 2000MAY01 
<400> 390 



Ala 


Ser 


Cys Ser 


Arg 


Arg 


Arg 


Glu 


Ala 


Leu 


Gin Arg Thr 


Ser 


Val 


1 






5 










10 






15 


Asn 


Met 


Gly Lys 


Thr 
20 


Arg 


Gly 


Met 


Gly 


Ala 
25 


Gly Arg Lys 


Leu 


Lys 
30 


Thr 


His 


Arg Arg 


Asn 


Gin 


Arg 


Trp Ala 


Asp 


Lys Ala Tyr 


Lys 


Lys 








35 










40 






45 


Ser 


His 


Leu Gly 


Asn 
50 


Glu 


Trp 


Lys 


Lys 


Pro 
55 


Phe Ala Gly 


Ser 


Ser 
60 


His 


Ala 


Lys Gly 


He 


Val 


Leu 


Glu 


Lys 


He 


Gly He Glu 


Ala 


Lys 






65 










70 






75 


Gin 


Pro 


Asn Ser 


Ala 
80 


He 


Arg 


Lys 


Cys 


Ala 
85 


Arg Val Gin 


Leu 


Val 
90 


Lys 


Asn 


Gly Lys 


Lys 


He 


Ala 


Ala 


Phe 


Val 


Pro Asn Asp 


Gly 


Cys 






95 










100 






105 


Leu 


Asn 


Tyr He 


Glu 


Glu 


Asn 


Asp 


Glu 


Val 


Leu He Ala 


Gly 


Phe 






110 








115 






120 


Gly 


Arg 


Lys Gly 


His 


Ala 


Val 


Gly Asp 


He 


Pro Gly Val 


Arg 


Phe 








125 










130 






135 


Lys 


Val 


Val Lys 


Val 


Ser 


Gly 


Val 


ser 


Leu 


Leu Ala Leu 


Phe 


Lys 








140 








145 






150 


Glu 


Lys 


Lys Glu 


Lys 
155 


Pro 


Arg 


Ser 













<210> 391 
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<211> 94 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 1 

<223> Incyte ID No: LI : 274551 . 1 .orf 1 :2000MAY01 

<400> 391 

Pro Phe Thr Val Thr Gin Leu Gin Pro Thr Thr Leu Gin Ser Phe 

15 10 15 

Pro Cys Leu Ser Val Leu Gin Arg Leu Ser His Val Ser Gly Phe 

20 25 30 

Leu Arg Ser Ser Thr Leu lie Gly Leu lie Trp Cys Ser Ala Gin 

35 40 45 

Arg Ala Thr Pro Ser Leu Thr Tyr lie Gly Ser Ser His Leu Asp 

50 55 60 

Ala Ser Thr Gin Arg Trp Ala Trp Pro Leu Ser Lys Ala Leu Ala 

65 70 75 

Ala Leu Gin Val Pro Pro Ala Arg Pro Ser Trp Leu Arg Ala Val 

80 85 90 

Phe Ser Leu Leu 



<210> 392 

<211> 83 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> incyte ID No: LI : 035973 . 1 .orf 3 : 2000MAY01 
<400> 392 

Gly Cys Leu Ala Gly lie Arg Lys Asp Asn Lys Met Lys Gly Thr 
1 5 10 15 

Ser Pro Phe Gly Lys Cys Arg Asp Met lie His Lys Leu Cys Cys 

20 25 30 

Leu Cys Gly Ser Lys Ala Tyr His Leu Gin Lys Ser Thr Cys Gly 

35 40 45 

Lys Cys Gly Ser Pro Ala Lys Arg Lys Arg Lys Cys Asn Trp Thr 

50 55 60 

Ala Thr Ala Lys Arg Lys Tyr His Gly Asp Trp Leu Asn Glu Ala 

65 70 75 

Pro Lys His Cys lie Leu Gin lie 

80 

<210> 393 
<211> 174 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG.-978427 . 5 .orf 2 :2000FEB18 
<220> 

<221> unsure 
<222> 151 

<223> unknown or other 
<400> 393 

Trp Trp Val Cys Asp Gly Cys Leu Cys Phe Arg Thr Thr Pro Ala 
15 10 15 

Val Leu Phe Trp Gin Trp lie Asn Gin Ser Phe Asn Ala Val Val 
20 25 30 

Asn Tyr Thr Asn Arg Ser Gly Asp Ala Pro Leu Thr Val Asn Glu 
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35 


40 


45 


Leu 


Gly 


Thr Ala 


Tyr 


Val Ser Ala Thr Thr Gly Ala Val Ala 


Thr, 






50 


55 


60 


Ala 


Leu 


Gly Leu 


Asn 


Ala Leu Thr Lys His Val Ser Pro Leu 


He 






65 


70 


75 


Gly 


Arg 


Phe Val 


Pro 


Phe Ala Ala Val Ala Ala Ala Asn Cys 


He 






80 


85 


90 


Asn 


lie 


Pro Leu 


Met 


Arg Gin Arg Glu Leu Lys Val Gly lie 


Pro 








95 


100 


105 


Val 


Thr 


Asp Glu 


Asn 


Gly Asn Arg Leu Gly Glu Ser Ala Asn 


Ala 






110 


115 


120 


Ala 


Lys 


Gin Ala 


lie 


Thr Gin Val Val Val Ser Arg lie Leu 


Met 


Ala 




125 


130 


135 


Ala 


Pro Gly 


Met 


Gly lie Pro Pro Phe lie Met Asn Thr 


Leu 








140 


145 


150 


Xaa 


Lys 


Lys Ala 


Phe 


Leu Lys Arg Phe Pro Met Asp Glu Cys 


Thr 








155 


160 


165 


His 


ser 


Ser Trp 


Val 


Ser Trp lie Leu , 










170 





<210> 394 
<211> 183 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG : 247781 . 2 . orf 3 : 2000FEB18 
<400> 394 

Gin Gly Pro Arg Val Leu Leu Ala Met Pro Tyr Leu Pro Asn Ser 

1 5 10 15 

Ala Gly Tyr His His Leu Cys Gly His Arg Pro Gly Arg Leu Arg 

20 25 30 

Asp Ser Glu Glu Leu Val Ala Ser Ala Val Gin Pro Arg Leu Gly 

35 40 45 

Arg Pro Arg His Pro Arg Ala Pro Gly Leu Arg Tyr His He Gin 

50 55 60 

His Leu Arg Pro Asp Ser Gin Leu Pro Ala Gly Pro Gly Pro Asp 

65 70 75 

Pro His Ala Gly Thr Ser Leu His Arg Gly Trp Pro Pro Ala Val 

80 85 90 
His Ala Gly Ser Ala Thr Ser His Pro Val Pro Gly Gly His Ala 

95 100 105 
Gly Pro Leu Pro Gly Asp Arg Pro Gin Leu His Glu Gly Tyr Ser 

110 115 120 

Ser Cys Glu His Leu Leu Cys Gly Leu Arg Glu His Glu Ala Gly 

125 130 135 

Leu Gly Gly His Val Gin Val Arg Asp Pro Glu Pro Val Pro Pro 

140 145 150 

He Pro His Pro Pro His Leu Ser His Trp Arg Leu Met He Gin 

155 160 165 

Pro Gin Asp Pro Tyr Ser Leu Ala Thr Arg Ser Gin Tyr Pro Asp 

170 175 180 

Pro Gly Ser 

<210> 395 
<211> 399 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc__f eature 

<223> Incyte ID No: LI : 034583 . 1 . orf 1 : 2000FEB01 
<400> 395 
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Val Gin His Ala^Kr Val He Pro Glu Thr Met Ala^^ Thr Gin 
1 5 10 15 

Gin Leu Ala Asp Trp Arg Asn Thr His Ala His Gly Ser His Tyr 
20. 25 30 

Asn Pro He Met Gin Gin Pro Ala Leu Leu Thr Gly His Val Thr 
35 40 45 

Leu Pro Ala Ala Gin Pro Leu Asn Val Gly Val Ala His Val Met 

,50 55 60 

Arg Gin Gin Pro Thr Ser Thr Thr Ser Ser Arg Lys Ser Lys Gin 
'. 65 70 75 

His Gin Ser Ser Val Arg Asn Val Ser Thr Cys Glu Val Ser Ser 
80 85 90 

Ser Gin Ala He Ser Ser Pro Gin Arg Ser Lys Arg' Val Lys Glu 
95 100 105 

Asn Thr Pro Pro Arg Cys Ala Met Val His Ser Ser Pro Ala Cys , 

110 ■ 115 120 

Ser Thr Ser Val Thr Cys Gly Trp Gly Asp Val Ala Ser Ser Thr 

125 " 130 135 

Thr Arg Glu Arg Gin Arg Gin Thr He Val He Pro Asp Thr Pro 

140 145 150 1 

Ser Pro Thr Val Ser Val He Thr He Ser 1 Ser Asp Thr Asp Glu ' 

155 160 165 

Glu Glu Glu Gin Lys His Ala Pro- Thr Ser Thr Val Ser Lys Gin 

170 175 180 

Arg Lys Asn Val He Ser Cys Val Thr Val His Asp Ser Pro Tyr 

185 "190 195 

Ser Asp Ser Ser Ser Asn Thr Ser Pro Tyr Ser Val Gin Gin Arg • , 

200 205 210 

Ala Gly His Asn Asn Ala Asn Ala Phe Asp Thr Lys Gly Ser Leu 

215 220 225 

Glu Asn His Cys Thr Gly Asn Pro Arg Thr He He Val Pro Pro 

230 235 240 

Leu Lys Thr Gin Ala Ser Glu Val Leu Val Glu Cys Asp Ser Leu 

245 250 255 

Val Pro Val Asn Thr Ser His His Ser Ser Ser Tyr Lys Ser Lys 

260 265 270 

Ser Ser Ser Asn Val Thr Ser Thr Ser Gly His Ser Ser Gly Ser 
» 275 280 285 

Ser Ser Gly Ala lie Thr Tyr Arg Gin Gin Arg Pro Gly Pro His 

290 295 300 

Phe Gin Gin Gin Gin Pro Leu Asn Leu Ser Gin Ala Gin Gin His 

305 310 315 

He Thr Thr Asp Arg Thr Gly Ser His Arg Arg Gin Gin Ala Tyr 

320 325 330 

He Thr Pro Thr Met Ala Gin Ala Pro Tyr Ser Phe Pro His Asn 

335 340 345 

Ser Pro Ser His Gly Thr Val His Pro His Leu Ala Ala Ala Ala 

350 355 360 

Ala Ala Ala His Leu Pro Thr Gin Pro His Leu Tyr Thr Tyr Thr 

365 370 375 

Ala Pro Ala Ala Leu Gly Ser Thr Gly Thr Val Ala His Leu Val 

380 385 390 

Ala Ser Gin Gly Ser Ala Arg His Thr 

395 

<210> 396 
<211> 301 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 333307 . 2 . orf 1 :2000FEB01 
<220> 

<221> unsure 
<222> 286 
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<223> unknovm or other " 
<400> 396 



Thr 


Gin Phe 


Ser 


Asp 


Asn 


Asn Asn 


Glu 


Lys 


Leu 


Ser 


Pro 


Lys 


Pro 


1 






5 








10 










15 


Gly 


Thr Gly 


Glu 


Pro 


Val 


Leu Ser 


Leu 


His 


Tyr 


Ser 


Thr 


Glu 


Gly 


i 


20 








25 










30 


Thr 


Thr Thr 


Ser Thr 


He 


Lys Leu 


Asn 


Phe 


Thr 


Asp Glu Trp 


Ser 








35 






40 










45 


Ser 


lie Ala 


Ser 


Ser 


Ser Arg Gly 


He 


Gly 


Ser 


His 


Cys 


Lys 


Ser 








50 








55 










60 


Glu 


Gly Gin 


Glu 


Glu 


Ser 


Phe Val 


Pro 


Gin 


Ser 


Ser 


Val 


Gin 


Pro 






65 








70 










75 


Pro 


Glu Gly 


Asp 


Ser 


Glu 


Thr Lys 


Ala , Pro 


Glu 


Glu 


Ser 


Ser 


Glu 




80 






85 










90 


Asp 


Val Thr 


Lys 


Tyr 


Gin 


Glu Gly 


Val 


Ser 


Ala 


Glu 


Asn 


Pro 


Val 




95 








100 










105 


GlU 


Asn His 


He 


Asn 


He 


Thr Gin 


Ser 


Asp 


Lys 


Phe 


Thr 


Ala 


Lys 








110 








115, 










120 


Pro 


Leu Asp 


Ser 


Asn 


Ser 


Gly Glu Arg Asn Asp 


Leu 


Asn 


Leu 


Asp 






125 








130 










13.5 


Arg 


Ser Cys 


Gly Val 


Pro 


GlU GlU, 


Ser 


Ala 


Ser 


Ser 


Glu 


Lys 


Ala 




140 








145 




Glu 






150 


Lys 


Glu Pro 


Glu 


Thr 


Ser 


Asp Gin 


Thr 


Ser 


Thr 


Ser 


Ala 


Thr 






155 






160 










165 


Asn 


Glu Asn 


Asn 


Thr 


Asn 


Pro Glu 


Pro 


Gin 


Phe 


Gin 


Thr 


Glu 


Ala 








170 








175 










180 


Thr 


Gly Pro 


Ser 


Ala 


His 


Glu Glu 


Thr 


Ser 


Thr 


Arg Asp 


Ser 


Ala 






185 








190 










195 


Leu 


Gin Asp 


Thr 


Asp 


Asp 


Ser Asp Asp Asp 


Pro 


Val 


Leu 


He 


Pro 






200 






205 










210 


Gly 


Ala Arg 


Tyr 


Arg 


Ala Gly Pro Gly Asp Arg 


Phe 


Asn 


He 


Arg 


215 








220 










225 


Gly 


Thr Thr 


He Gly 


Asp 


Arg He 


Met 


Arg 


Arg 


Ser 


Ala 


Val 


Ala 






230 






23 5 










240 


Arg 


He Gin 


Gll^ 


Phe 


Phe Arg Arg Arg 


Lys 


Glu 


Arg 


Lys 


Glu 


Met 




245 








250 










255 


Glu 


Glu Leu 


Asp 


Thr 


Leu 


Asn lie Arg Arg 


Pro 


Leu 


Val 


Lys 


Met 






260 








265 










270 


Val 


Tyr Lys 


Gly His 


Arg Asn Ser 


Arg 


Thr 


Met 


He 


Lys 


Glu 


Ala 






275 








280 










285 


Xaa 


Phe Trp 


Gly Ala 


Asn 


Phe Val 


Met 


Ser 


Gly 


Ser Asp 


Cys 


Gly 






290 








295 










300 



His 



<210> 397 
<211> 105 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: LI : 814710 . 2 .orf 2 : 2000FEB01 
<400> 397 



GlU 


Gly 


Pro 


Glu 


Gly 


Lys 


Trp 


Lys 


Ser 


Pro 


Lys 


Phe Lys 


Met 


Pro 


1 






5 








10 








15 


GlU 


Met 


His 


Phe 


Lys 


Thr 


Pro 


Lys 


He 


Ser 


Met 


Pro Asp 


He 


Asp 










20 








25 








30 


Leu 


Asn 


Leu 


Thr 


Gly 


Pro 


Lys 


He 


Lys 


Gly 


Asp 


Val Asp Val 


Thr 










35 










40 








45 


Gly 


Pro 


Lys 


Val 


Glu 


Gly 


Asp 


Leu Lys Gly 


Pro 


Glu Val 


Asp 


Leu 






50 








55 








60 


Lys 


Gly 


Pro 


Lys 


Val 


Asp 


lie 


Asp 


Val 


Pro 


Asp 


Val Asn 


Val 


Gin 








65 








70 








75 


Gly 


Pro 


Asp 


Trp 


His 


Leu 


Lys 


Met 


Pro 


Lys 


Met 


Lys Met 


Pro 


Lys 
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50 85 90 

Phe Ser Met Pro Gly Phe Lys Gly Glu Gly Pro Gly Ser Arg Ser 
95 100 105 



<210> 398 
<211> 153 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG : 414732 . 1 .orf 1 :2000MAY19 
<400> 398 

Trp Tyr Arg Arg Leu Leu Arg Glu Ser Gly Ser Thr Met Asp He 
1 5 10 15 

Pro Val Pro Ser Ser Phe Asn Asp Val Gly Gin Asp Trp Arg Leu 

20 25 30 

Arg His Phe Val Asp Gin Met Trp Tyr Glu Arg Glu Val Thr Phe 

35 40 45 

Leu Glu Gin Trp Thr Gin Asp Leu His Thr Arg Val Val Leu Arg 

50 55 . 60 

He Val Ser Ala His Ser Tyr Ala He Val Trp Val Asn Gly Val 

65 70 75 

Asp Ala Leu Glu His Glu Gly Ser Thr Ser Pro Leu Thr Pro Thr 

80 85 90 

Ser Val Ala Cys Ser Arg Trp Gly Pro Cys Pro Pro Ala Ser Ala 

95 100 105 

Ser Leu Ser Pro Ser Ala Thr Cys Ser Ser Pro Pro Pro Cys His 
110 115 , 120 

Gin Gly Ala Ser Ser Thr Trp Pro Thr Pro Pro Arg Gly Tyr His 
125 130 135 

Pro Ala Ser Thr Ala Asp Thr His Leu Pro Val Pro Pro Arg Gly 
140 145 150 

Ala Leu His 



<210> 399 
<211> 161 
<212> PRT 
<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: LG:413910 . 6 .orf 1 :2000MAY19 
<400> 399 



Ser 


Met Leu 


Ala 


Ser 


Gin Gly Val Leu 


Leu 


His 


Pro' Tyr Gly 


Val 


1 






5 


10 






15 


Pro 


Met He 


Val 


Pro 


Ala Ala Pro Tyr 


Leu 


Pro 


Gly Leu He 


Gin 








20 




25 






30 


Gly 


Asn Gin 


Glu 


Ala 


Ala Ala Ala Pro 


Asp 


Thr 


Met Ala Gin 


Pro 








35 




40 






45 


Tyr 


Ala Ser 


Ala 


Gin 


Phe Ala Pro Pro 


Gin 


Asn 


Gly He Pro 


Ala 








50 




55 




60 


Glu 


Tyr Thr Ala Pro 


His Pro His Pro 


Ala 


Pro 


Glu Tyr Thr 


Gly 








65 




70 






75 


Gin 


Thr Thr 


Val 


Pro 


Glu His Thr Leu 


Asn 


Leu 


Tyr Pro Pro 


Ala 








80 




85 




90 


Gin 


Thr His 


Ser 


Glu 


Gin Ser Pro Ala 


Val 


Phe 


Leu Phe Val 


He 








95 




100 






105 


Thr 


Arg Ala Val Ala 


Leu Phe Thr Ser 


He 


Leu 


Arg Pro Ser 


Thr 








110 




115 




120 


Thr 


Val Pro 


Cys 


Asn 


Phe Ser Leu Ala 


Leu 


Ser 


Ala Ser Ala 


Leu 






125 




130 






135 


Phe 


Ser Lys 


Val 


Thr 


Lys Pro Asn Pro 


Phe 


Glu 


Pro Arg Ser 


Leu 
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140 1 145 ' 150 

Lys lie He Ser Thr Ser Lys He Leu Pro Asn 
155 160 

<210> 400 
<211> 153 * 
<212> PRT ' 
<213> Homo sapiens 

<220> 

<221> misc_ f eature 

<223> Incyte ID No: LI : 414732 . 2 .orf 1 : 2000MAY01 
<400> 400 



Trp 


Tyr 


Arg Arg 


Leu 


Leu Arg Glu 


Ser 


Gly 


Ser 


Thr Met 


Asp He 


1 




5 






10 






15 


Pro 


Val 


Pro Ser 


Ser 


Phe Asn Asp Val 


Gly Gin 


Asp Trp Arg Leu 








20 






25 






30 


Arg 


His 


Phe Val 


Asp 


Gin Met Trp 


Tyr 


Glu 


Arg 


Glu Val 


Thr Phe 






35 






40 






45 


Leu 


Glu 


Gin Trp 


Thr 


Gin Asp Leu His 


Thr Arg 


Val Val 


Leu Arg 






50 






55 






60 


He 


Val 


Ser Ala 


His 


Ser Tyr Ala 


lie 


Val 


Trp 


Val Asn Gly Val 








65 




70 




Leu Thr 


75 


Asp 


Ala 


Leu Glu 


His 


Glu Gly Ser 


Thr 


Ser 


Pro 


Pro Thr 






80 




85 






90 


Ser 


Val 


Ala Cys 


Ser 


Arg Trp Gly 


Pro 


Cys 


Pro 


Pro Ala 


Ser Ala 






95 




100 






105 


Ser 


Leu 


Ser Pro 


Ser 


Ala Thr Cys 


Ser 


Ser 


Pro 


Pro Pro 


Cys His 








110 




115 






120 


Gin 


Gly 


Ala Ser 


Ser 


Thr Trp Pro 


Thr 


Pro 


Pro 


Arg Gly 


Tyr His 






125 




130 






135 


Pro 


Ala 


Ser Thr 


Ala 


Asp Thr His 


Leu 


Pro 


Val 


Pro Pro 


Arg Gly 








140 




145 






150 


Ala 


Leu 


His 

















<210> 401 
<211> 135 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 900264 . 2 . orf 3 : 2000MAY01 
<400> 401 



Arg 


Gin 


He 


Val 


Ser 


Val Leu 


Ser 


Cys 


Arg 


Phe 


Asp Ser Asn 


Gin 


1 








5 






10 






15 


Leu 


Pro 


Ala 


Asn 


Ala 


Pro He 


Glu 


Asp 


Arg 


Arg 


Ser Ala Ala 


Thr 










20 








25 






30 


Cys 


Leu 


Gin 


Thr 


Arg 


Gly Leu 


Leu 


Leu 


Gly 


Val 


Phe Asp Gly 


His 








35 






40 






45 


Ala 


Gly 


Cys 


Ala 


Cys 


Ser Gin 


Ala 


Val 


Ser 


Glu 


Arg Leu Phe 


Tyr 








50 








55 






60 


Tyr 


He 


Ala 


Val 


Ser 


Leu Leu 


Pro 


His 


Glu 


Thr 


Leu Leu Glu 


He 








65 








70 






75 


Glu 


Asn 


Ala 


Val 


Glu 


Ser Gly 


Arg 


Ala 


Leu 


Leu 


Pro He Leu 


Gin 










80 




85 






90 


Trp 


His 


Lys 


His 


Pro 


Asn Asp 


Tyr 


Phe 


Ser 


Lys 


Glu Ala Ser 


Lys 






95 








100 






105 


Leu 


Tyr 


Phe 


Asn 


Ser 


Leu Arg 


Thr 


Tyr 


Trp 


Gin 


Gly Ala Tyr Arg 








110 






115 






120 


Pro 


Gin 


His 


Trp 


Val 


Ser Arg 


Leu 


He 


Leu 


Met 


Leu Arg Arg 


Leu 








125 






130 






135 
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<210> 402 
<211> 129 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 335593 . 1 .orf 3 :2000MAY01 
<220> 

<221> unsure 

<222> 28, 39-40 

<223> unknown or other 

<400> 402 



Arg 


Gly 


Ala Gly 


Thr 


Ala 


Ala 


Leu Pro Ser 


Arg 


Leu 


Leu 


Pro 


Ser 


1 






5 






10 










15 


Ser 


Ala 


Ala Arg 


Ser 


Ser 


Leu 


Gly He His 


Leu 


Leu 


Xaa 


Leu 


Leu 








20 






25 










30 


Leu 


Leu 


lie His 


Ser 


Phe 


Pro 


His Xaa Xaa 


Leu Leu Leu Gly 


Phe 








35 






40 










45 


Ser 


Pro 


Arg Pro 


Ala 


Ser 


Pro 


Arg Ala Leu 


Pro 


Leu 


Pro 


Leu 


Pro 








50 






55 










60 


Val 


Leu 


Pro Gly 


Pro 


Leu Leu 


Pro Leu He 


His 


Ser 


Pro 


Leu 


Ser 








65 






70 










75 


Leu 


Leu 


His Ser 


Leu 


Pro 


Leu 


Ser Pro Phe 


Phe 


Phe 


Phe 


Phe 


His 








80 






, 85 










90 


Pro 


Pro 


Ser Leu 


Thr 


Pro 


Pro 


Pro Phe' Pro 


Cys 


Leu 


Leu 


Ser 


Asp 








95 






100 








105 


Thr 


Ala 


Leu Gin 


Leu 


Leu 


Leu 


Ser Pro Ala 


Pro 


Ser 


Pro 


Val 


Arg 








110 






115 










120 


Thr 


Asn 


Gin Gin 


His 


Cys 


Phe 


Phe Ser 













125 

<210> 403 
<211> 299 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 1189543 . 1 .orf 1 : 2000MAY01 
<400> 403 



Glu 


Phe Arg Gin Asn 


Lys Arg Glu Asn 


Leu 


Leu 


Pro Val Ala 


Ala 


1 


5 


10 






15 


Ala 


Gly Thr Ala Asn 


Met Met Ala Ala 


Ala 


Pro 


He Gin Gin 


Asn 




20 




25 






30 


Gly 


Thr His Thr Gly 


Val Pro He Asp 


Leu 


Asp 


Pro' Pro Asp 


Ser 




35 




40 




45 


Arg 


Lys Arg Pro Leu 


Glu Ala Pro Pro 


Glu 


Ala 


Gly Ser Thr 


Lys 




50 




55 






60 


Arg 


Thr Asn Thr Gly 


Glu Asp Gly Gin 


Tyr 


Phe 


Leu Lys Val 


Leu 




65 




70 






75 


He 


Pro Ser Tyr Ala 


Ala Gly Ser He 


He 


Gly 


Lys Gly Gly 


Gin 




80 




85 






90 


Thr 


He Val Gin Leu 


Gin Lys Glu Thr 


Gly 


Ala 


Thr He Lys 


Leu 




95 


100 




105 


Ser 


Lys Leu Ser Lys 


Ser Lys Asp Phe 


Tyr 


Pro 


Gly Thr Thr 


Glu 




110 




115 






120 


Arg 


Val Cys Leu He 


Gin Gly Thr Val 


Glu 


Ala 


Leu Asn Ala 


Val 




125 


130 






135 


His 


Gly Phe He Ala 


Glu Lys He Arg 


Glu 


Met 


Pro Gin Asn 


Val 




140 




145 






150 


Ala 


Lys Thr Glu Pro 


Val Ser He Leu 


Gin 


Pro 


Gin Thr Thr 


Val 




155 




16.0 






165 


Asn 


Pro Asp Arg He 


Lys Gin Thr Leu 


Pro 


Ser 


Ser Pro Thr 


Thr 
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Thr Lys ser Ser 
Gin Val Lys lie 
Gly Lys Gly Gly 
Ala Trp Val Gin 
Glu Arg Val Val 
Ala Val Glu Leu 
Gly Ser Cys Leu 
Gly Lys Phe Gin 



<210> 404 
<211> 142 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG : 455450 . 1 . orfl : 2000FEB18 
<400> 404 



Gly 


Gly 


Gly 


He 


Leu 


Arg 


Arg 


Ser Gly 


Ser 


Ser 


Ser Ser Ser 


Ser 


1 








5 










10 






15 


Ser 


Ser 


Ser 


Ser 


GlU 


Asp 


Asp 


Gly Met 


Gly 


Gly 


Arg Arg Lys 


Lys 










20 






25 






30 


Gly 


Leu 


Lys 


Glu 


Lys 
35 


He 


Lys 


Glu 


Lys 


Met 
40 


Pro 


Gly Gly His 


Arg 
45 


Glu 


Gly 


Gin 


Gly 


Gin 


Ala 


Thr 


Ala 


Thr 


Gly 


Ala 


Tyr Gly Gly 


Thr 






50 










55 






60 


Gly 


Tyr 


Val 


Ala 


Gly 


Pro 


Thr 


Thr 


Gly 


Gly 


Pro 


His Glu Lys 


Lys 








65 










70 






75 


Gly 


Val 


Val 


Glu 


Lys 
80 


He 


Lys 


Glu 


Lys 


He 
85 


Pro 


Gly Gly His 


Lys 
90 


Asp 


Tyr 


Asp 


Gin 


His 


Gin 


His 


Thr 


Thr 


Ala 


Ala 


Thr Gly Gly 


Gly 




95 










100 






105 


Gly 


Gly 


Tyr 


Gly 


Gly 
110 


Thr 


Thr 


Asp 


Thr 


Thr 
115 


Tyr 


Gly Thr Thr 


Thr 
120 


Thr 


Glu 


Gly 


Thr 


His 


GlU 


Lys 


Lys 


Gly 


Phe 


Met 


Asp Lys He 


Lys 








125 










13 0 






135 


Glu 


Lys 


Leu 


Pro 


Gly 
140 


Gin. 


His 















<210> 405 
<211> 168 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG : 1040978 . 1 . orf 2 : 2000FEB18 
<400> 405 



Ser Tyr Leu Arg Ser 


Arg Gly Gin 


Pro 


Pro 


Pro 


Arg Arg Ser 


His 


1 5 






10 






15 


Ala Leu Arg Ala Arg 


Arg Leu Ser 


Ser 


Val 


Ser 


Ala Ser Leu 


Pro 


20 




25 






30 


Leu Pro Ser Arg Leu 


Thr His Met 


Ala 


Ser 


He 


Ala Gly Ser 


Ser 


35 






40 






45 


Ala Leu Ser Phe Ala 


Arg Pro Val 


Lys 


Ala 


He 


Asn Thr Asn 


Ser 


50 


55 






60 
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170 175 T80 

Pro Ser Asp Pro Met Thr Thr Ser Arg Ala Asn, 
185 190 195 

He Val Pro Asn Ser Thr Ala Gly Leu He He 
200 205 210 

Ala Thr Val Lys Ala Val Met Glu Gin Ser Gly 
215 220 225 

Leu Ser Gin Lys Pro Asp Gly He Asn Leu Gin 
230 235 240 

Thr Val Ser Gly Glu Pro Glu Gin Asn Arg Lys 
245 250 255 

He He Gin Lys He Gin Glu Asp Pro Gin Ser 
260 265 270 

Asn He Ser Tyr Ala Asn Val Thr Gly Pro Val 
275 280 285 

Ser Asn Arg He Ser Leu Cys Lys His . Cys 
290 295 
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Leu 


Ala 


Phe 


Sej^¥ro 


Ala 


Arg Lys Gly 


Asn 


Thr 


Phe^fu Arg Leu 








65 




70 




75 


Gin 


Pro 


Met 


Pro Met 


Arg 


Ser Val Ser 


Cys 


Ala 


Ala Lys Lys Asp 








80, 






85 




90 


Thr 


Thr 


Asp 


Lys Val 


Cys 


Glu He Val 


Lys 


Lys 


Gin Leu Ala Leu 








95 






100 




105 


Pro 


Asp 


His 


Thr Glu 


Val 


Cys Gly Glu 


Ser 


Lys 


Phe Ser Glu Leu 






110 






115 




120 


Gly 


Ala 


Asp 


Ser Leu 


Asp 


Thr Val Glu' 


He 


Val 


Met Ser Leu Glu 








125 






130 




135 


Glu 


His 


Phe 


Asp He 


Ser 


Val Glu Glu 


Ser 


Ser 


Ala Gin Thr He 








140 






145 




150 


Ala 


Thr 


Val 


Glu Asp 


Ala 


Ala Asp Leu 


He 


Asp 


Lys 1 Leu Val Ala 








155 






160 




165 


Gly 


Lys 


Ala 















<210> 406 
<211> 117 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID Nb: LG : 446649 . 1 .orf 2 :2000FEB18 



<400> 406 












Leu 


Lys 


Ala 


Arg Glu 


Gin Ala Gin Lys 


Arg 


Glu 


Ala He Gin Val 


1 






5 




10 




15 


Thr 


Ser 


Pro 


Val Cys 


Leu Arg Leu He 


Leu 


Arg 


Lys Ala Gly Glu 








20 




25 




30 


Glu 


Val 


Lys 


Arg Leu 


Lys Thr Gin Pro 


Thr 


Asp 


Glu Glu Met Leu 








35 




40 




45 


Phe 


He 


Tyr 


Ser His 


Phe Lys Gin Ala 


Thr 


Val 


Gly Asp Val Asn 








50 




55 




60 


Thr 


Asp 


Arg 


Pro Gly 


Leu Leu Asp Leu 


Lys 


Gly 


Lys Ala Lys Trp 








65 




70 




75 


Asp 


Ser 


Trp 


Asn Lys 


Leu Lys Gly Thr 


Ser 


Lys 


Glu Asn Ala Met 








80 


85 




90 


Lys 


Thr 


Tyr 


Val Glu 


Lys Val Glu* Glu 


Leu 


Lys 


Lys Lys Tyr Gly 








95 




100 




105 


He 


Leu 


Thr 


Thr Arg 


Phe Gly Gly Gin 


Pro 


His 


Val 








110 


115 







<210> 407 
<211> 804 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 132147 . 3 .orf 3 :2000FEB18 
<400> 407 



Ala 


Ser Ser Ala Ala 


Gin Asp Asp Tyr 


Ala 


Arg 


His Val Arg Asp 


1 


5 




10 




15 


Pro 


Pro Ala Val Ala 


Val Ala Ala Pro 


Lys 


Ala 


Arg Ala Ser Ala 




20 




25 




30 


Pro 


Asp Ala Val Val 


Thr Pro Pro Arg 


Leu 


Ala 


Val Ala Pro Val 




35 


40 




45 


Pro 


Val Leu Pro Ala 


Ala Ser Ala Ala 


Gly 


Asp 


Met Ser Asn Pro 




50 




55 




60 


Gly 


Ser Arg Arg Asn 


Gly Pro Val Lys 


Leu 


His 


Leu Thr Val Leu 




65 


70 




75 


Cys 


Ala Lys Asn Leu 


Val Lys Lys Asp 


Phe 


Phe 


Arg Leu Pro Asp 




80 




85 




90 


Pro 


Phe Ala Lys Val 


Val Val Asp Gly 


Ser 


Gly 


Gin Cys His Ser 
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WO 01/62927 

95 



Thr 


Asp 


Thr 


Val 


Lys 


Asn 


Thr 


Leu 










110 








Tyx 


Asp 


Leu Tyr 


He 


Gly 


Lys 


Ser 










125 


• 






Trp 


Asn 


His 


Lys 


Lys 


He 


His 


Lys 










140 








Gly 


Cys 


Val Arg Leu 


Leu 


Ser 


Asn 










155 








Thr 


Gly 


Tyr 


Gin 


Arg 


Leu 


Asp 


Leu 










170 








Asn 


Asp 


Thr 


Val 


Arg 


Gly 


Gin 


He 










185 








Asp 


Arg 


He Gly Thr 


Gly 


Gly 


Gin 










200 








Phe 


Asp 


Asn 


Asp 


Leu 


Pro 


Asp 


Gly 










215 








Ser 


Gly 


Arg 


He 


Gin 


Tyr 


Leu 


Asn 










230 








Trp 


Glu 


Arg 


Pro 


Thr 


Arg 


Pro 


Ala 










245 








Arg 


Pro 


Leu 


Ser 


Cys 


Phe 


Val 


Asp, 










200 








Thr 


Asn 


Gly Ala 


Thr 


Cys 


Gly 


Gin 










275 








Glu 


Arg 


Arg Val 


Arg 


Ser 


Gin 


Arg 










290 








Thr 


His 


Leu 


His 


Thr 


Pro 


Pro 


Asp 










305 






Arg 


Thr 


Thr 


Gin 


Gin 


Gly 


Gin 


Val 








320 






Gly 


Val 


Ser 


Thr 


Trp 


His 


Asp 


Pro 










335 








Asn 


He 


Asn Cys 


Glu 


Glu 


Leu 


Gly 










350 








lie 


Arg 


Asn 


Thr; 


Ala 


Thr 


Gly 


Arg 










365 








Asn 


Arg 


Thr 


Thr 


Gin 


Phe 


Thr 


Asp 










380 








His 


Leu 


Val 


Leu 


Asn 


Arg 


Gin 


Asn 










395 






Gin 


Gin 


Val 


Val 


Ser 


Leu 


Cys 


Pro 










410 






Val 


Pro 


Arg 


Tyx 


Lys 


Arg 


Asp 


Leu 










425 








Arg 


Gin 


Glu 


Leu 


Ser 


Gin 


Gin 


Gin 










440 








He 


Glu 


Val 


Ser 


Arg 


Glu 


Glu 


He 










455 








Val 


Met 


Lys 


Met 


Arg 


Pro 


Lys 


Asp 










470 








Lys 


Phe 


Arg 


Gly Glu 


Glu 


Gly 


Leu 










485 








Glu 


Trp 


Leu 


Tyr 


Leu 


Leu 


Ser 


His 










500 








Gly 


Leu 


Phe 


Gin 


Tyr 


Ser 


Arg 


Asp 










515 








Asn 


Pro 


Asp 


Ser 


Ala 


Val 


Asn 


Pro 










530 








Phe 


Val 


Gly Arg 


He 


Met 


Gly 


Met 










545 








He 


Asp 


Gly 


Gly 


Phe 


Thr 


Leu 


Pro 










560 








Lys 


Ser 


He 


Thr 


Leu 


Asp 


Asp 


Met 










575 








His 


Asn 


Ser 


Leu 


Val 


Trp 


He 


Leu 



590 
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100 




• 105 


Asp 


Pro 


Lys 


Trp Asn Gin His 


115 




120 


Asp 

* 


Ser 


Val 


Thr He Ser Val 


130 




135 


Lys 

• 


Gin 


Gly 


Ala Gly Phe Leu 


145 




150 


Ala 


He 


Asn 


Arg Leu Lys Asp 




160 




165 


Cys 


Lys 


Leu 


Gly Pro Asn Asp 




175 




180 


Val 


Val 


Ser 


Leu Gin Ser Arg 




190 




195 


Val, 


Val 


Asp 


Cys Ser Arg Leu 




205 




210 


Trp 


Glu 


Glu 


Arg Arg Thr Ala 


220 




225 


His 


He 


Thr 


Arg Thr Thr Gin 




235, 




240 


Ser 


Glu 


Tyr 


Ser Ser Pro Gly 




250 




255 


Glu 


Asn 


Thr 


Pro He Ser Gly 




265 




270 


Ser 


Ser 


Asp 


Pro Arg Leu Ala 




280 




285 


His 


Arg 


Asn 


Tyr Met Ser Arg 




295 




300 


Leu 


Pro 


Glu 


Gly Tyr Glu Gin 




310 




315 


Tyr 


Phe 


Leu 


His Thr Gin Thr 


325 




330 


Arg Val 


Pro 


Arg Asp Leu Ser 




340 




345 


Pro 


Leu 


Pro 


Pro Gly Trp Glu 




355 




360 


Val 


Tyr 


Phe 


Val Asp His Asn 




370 




375 


Pro 


Arg 


Leu 


Ser Ala Asn Leu 




385 




390 


Gin 


Leu 


Lys 


Asp Gin Gin Gin 




400 




405 


Asp 


Asp 


Thr 


Glu Cys Leu Thr 




415 




420 


Val 


Gin 


Lys 


Leu Lys He Leu 




43 0 




435 


Pro 


Gin 


Ala 


Gly His cys Arg 




445 




450 


Phe 


Glu 


Glu 


Ser Tyr Arg Gin 




460 




465 


Leu 


Trp 


Lys 


Arg Leu Met He 




475 




480 


Asp 


Tyr 


Gly 


Gly Val Ala Arg 




490 




495 


Glu 


Met 


Leu 


Asn Pro Tyr Tyr 




505 




510 


Asp 


He 


Tyr 


Thr Leu Gin He 




520 




525 


Glu 


His 


Leu 


Ser Tyr Phe His 




535 




540 


Ala 


Val 


Phe 


His Gly His Tyr 




550 




555 


Phe 


Tyx 


Lys 


Gin Leu Leu Gly 




565 




570 


Glu 


Leu 


Val 


Asp Pro Asp Leu 




580 




585 


Glu 


Asn 


Asp 


He Thr Gly Val 




595 


600 
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Leu 


Asp 


His Thr 


Phe 


Cys 


Val 


Glu His 


Asn 


Ala Tyr 


Giy Glu He 








605 






610 




615 


He 


Gin 


His Glu 


Leu 


Lys 


Pro 


Asn Gly 


Lys 


Ser He 


Pro Val Asn 








620 




625 


•i 


630 


Glu 


GlU 


Asn Lys 


Lys 


Glu 


Tyr 


Val Arg 


Leu 


Tyr val 


Asn Trp Arg 








635 








640 




645 


Phe 


Leu 


Arg Gly 


He 


Glu 


Ala 


Gin Phe 


Leu 


Ala Leu 


Gin Lys Gly 






650 






* 


655 




660 


Phe 


Asn 


Glu Val 


He 


Pro 


Gin 


His Leu 


Leu 


Lys Thr 


Phe Asp Glu 








665 








670 




675 


Lys 


GlU 


Leu Glu 


Leu 


He 


He 


Cys Gly 


Leu 


Gly Lys 


He Asp Val 








680 








685 




690 


Asn 


Asp 


Trp Lys 


Val 


Asn 


Thr 


Arg Leu 


Lys 


His Cys 


Thr Pro Asp 








695 








700 




705 


Ser 


Asn 


He Val 


Lys 


Trp 


Phe 


Trp Lys 


Ala 


Val Glu 


Phe Phe Asp 








710 








715 




720 


Glu 


GlU 


Arg Arg 


Ala 


Arg 


Leu 


Leu Gin 


Phe 


Val Thr 


Gly Ser Ser 








725 








730 




735 


Arg 


Val 


Pro Leu 


Gin 


Gly 


Phe 


Lys Ala 


Leu 


Gin Gly 


Ala Ala Gly 








740 








745 




750 


Pro 


Arg 


Leu Phe 


Thr 


He 


His 


Gin He 


Asp 


Ala Cys 


Thr Asn Asn 








755 








760 




765 


Leu 


Pro 


Lys Ala 


His 


Thr 


Cys 


Phe Asn 


Arg 


He Asp 


He Pro Pro 








770 






775 




780 


Tyr 


Glu 


Ser Tyr 


Glu 


Lys 


Leu 


Tyr Glu 


Lys 


Leu Leu 


Thr Ala He 








785 








790 




795 


Glu 


GlU 


Thr Cys 


Gly 


Phe 


Ala 


Val Glu 









<210> 408 
<211> 220 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI: 03 6034. l.orf 1:2000FEB01 
<400> 408 



Thr 


He 


His Leu Lys 


Thr 


Leu 


He He 


Val 


Trp 


Lys 


Arg Tyr Ser 


1 




5 








10 






15 


Asp 


Phe 


Lys Lys Leu 


His 


Lys 


Glu Leu 


Trp 


Gin 


He 


His Lys Asn 






20 








25 






30 


Leu 


Phe 


Arg His Ser 


GlU 


Leu 


Phe Pro 


Pro 


Phe 


Ala 


Lys Gly He 






35 








40 






45 


Val 


Phe 


Gly Arg Phe 


Asp 


Glu 


Thr Val 


He 


Glu 


Glu 


Arg Arg Gin 






50 








55 






60 


Tyr 


Ala 


Glu Asp Leu 


Leu 


Gin 


Phe Ser 


Ala 


Asn 


He 


Pro Ala Leu 






65 








70 






75 


Tyr 


Asn 


Ser Lys Gin 


Leu 


Glu 


Asp Phe 


Phe 


Lys 


Gly 


Gly He He 






80 








85 






90 


Asn 


Asp 


Ser Ser Glu 


Leu 


He 


Gly Pro 


Ala 


GlU 


Ala 


His Ser Asp 






95 






100 






105 


Ser 


Leu 


He Asp Thr 


Phe 


Pro 


Glu Cys 


Ser 


Thr 


Glu 


Gly Phe Ser 






110 






115 






120 


Ser 


Asp 


Ser Asp Leu 


Val 


Ser 


Leu Thr 


Val 


Asp Val 


Asp Ser Leu 






125 








130 






135 


Ala 


Glu 


Leu Asp Asp 


Gly 


Met 


Ala Ser 


Asn 


Gin Asn 


Ser Pro He 






140 








145 






150 


Arg 


Thr 


Phe Gly Leu 


Asn 


Leu 


Ser Ser 


Asp 


Ser 


Ser 


Ala Leu Gly 






155 








160 






165 


Ala 


Val 


Ala Ser Asp 


Ser 


Glu 


Gin Ser 


Lys 


Thr 


Glu 


Glu Glu Arg 






170 








175 






180 


Glu 


Ser 


Arg Ser Leu 


Phe 


Pro 


Gly Ser 


Leu 


Lys 


Pro 


Lys Leu Gly 






185 






190 






195 


Lys 


Arg 


Asp Tyr Leu 


Glu 


Lys 


Ala Gly 


Glu 


Leu 


He 


Lys Leu Ala 



800 



200 



205 



210 
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Leu Lys Lys Glu Glu Glu Asp Asp Tyr Glu 
215 220 

<210> 409 
<211> 168 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 162161 . 1 . orf 2 : 2000MAY19 
<4Q0> 409 

Gin Asp His Gin Phe Ala Pro Gin Ser Thr Met Ser Arg Ser Arg 
1 5 .10 15 

Gin Pro Pro Leu Val Thr Gly lie Ser Pro Asn Glu Gly. lie Pro 
20 25 30 

Trp Thr Lys Val Thr lie Arg Gly Glu Asn Leu Gly Thr Gly Pro 
35 40 45 

Thr Asp Leu lie Gly Leu Thr He Cys Gly His Asn Cys Leu Leu 
50 55 60 

Thr Ala Glu Trp Met Ser Ala Ser Lys He Val Cys Arg Val Gly 
65 70 75 

Gin Ala Lys Asn Asp Lys Gly Asp He He Val Thr Thr Lys Ser 
80 85 90 

Gly Gly Arg Gly Thr Ser Thr Val Ser Phe Lys Leu Leu Lys Pro 
95 100 105 

Glu Lys He Gly He Leu Asp Gin Set Ala Val Trp Val Asp Glu 
110 115 120 

Met Asn Tyr Tyr Asp Met Arg Thr Asp Arg Asn Lys Gly He Pro 
125 130 135 

Pro Leu Ser Leu Arg Pro Ala Asn Pro Leu Gly Met Glu He Glu 
140 145 150 

Pro Ser Thr Phe Ser Gin Lys Asp Leu Glu Met Leu Phe His Gly 
155 160 165 

Met Ser Ala 



<210> 410 
<211> 108 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LG : 407214 . 10 . or f 2 : 2000MAY19 
<400> 410 



Lys 


Ser 


Arg Met 


Asp 


Leu Asp Val Val 


Asn 


Met Phe 


Val 


He Ala 


1 






5 






10 






15 


Gly 


Gly 


Thr Leu 


Ala 


lie 


Pro He Leu 


Ala 


Phe Val 


Ala 


Ser Phe 






20 






25 






30 


Leu 


Leu 


Trp Pro 


Ser 


Ala 


Leu He Arg 


He 


Tyr Tyr 


Trp 


Tyr Trp 








35 






40 






45 


Arg 


Arg 


Thr Leu Gly 


Met 


Gin Val Arg 


Tyr 


Val His 


His 


Glu Asp 








50 






55 






60 


Tyr 


Gin 


Phe Cys 


Tyr 


Ser 


Phe Arg Gly 


Arg 


Pro Gly His 


Lys Pro 








65 






70 






75 


Ser 


He 


Leu Met 


Leu 


His 


Gly Phe Ser 


Gly 


His Lys 


Asp 


Met Trp 








80 




85 






90 


Leu 


Ser 


Val Val 


Lys 


Val 


Pro Ser Lys 


Glu 


Pro Ala 


Leu 


Gly Leu 








.95 






100 






105 


Arg 


Gly 


His 

















<210> 411 
<211> 314 
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<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 204626,. 1 . orf 1 :2000MAY19 
<400> 411 ' 



Gl v 


Vai 
vai 


Ala 


Arg Lys 


Lys Lys He Vai" He Lys 


f2ln fil n Ptt> Ta/q 
vjIU V9J.U c J. \J o 


1 

•L 








5 


10 




15 




Aiy 


Lljfo 


Gly Lys 


Met Lys Asp Arg Leu Gin 


fH ii T.on TiVTe CI 1 ti 
17JLU Ucu. jjyo . VarXXl 










20 


25 






Arg 


ill XT 


uys 


Glu 


He 


Glu Leu Ser Arg Asp Ser 


Hie* Vai Cor fVir* 
n,ib vet j. oci xiiJ. 










35 


40 




AC 


Thr 


v?iu 


inr 


Glu 


Glu 


Gin Gly Vai Phe Leu 


Gin 


lain Ala vai lie 










50 


• 55 




ou 


lyr 


r»i ii 

ValU 


Arg 


Glu 


Pro 


Vai Ala Glu Arg His Leu 


tt-4 o fin TI a Pin 
nlS blu lie <jjJ.il 










65 


70 




/ D 


Lys 


Leu 


v?in 


Glu 


Ser 


He Asn Asn Leu Ala Asp 


Asn vai tain iiys 








80 


85 




on 


pne 


Gly 


Gin 


Gin 


Gin 


Lys Ser Leu Vai Ala- 


Ser 


Met Arg Arg pne 








95 


100 




1UD 


Ser 


Lieu 


Leu 


Lys 


Arg. 


Glu ser Thr- He Thr 


Lys 


uiu lie iiys lie 










110 


115 




ion 

izu 


Gin 


Ala 


Glu 


Tyr 


He 


Asn Arg Ser Leu Asn 


Asp 


Leu Vai Lys Glu 










125 


130 




1J3 


vai 


Lys 


Lys 


Ser 


Glu 


Vai Glu Asn Gly Pro 


Ser 


Ser vai vai Tnr 








140 


145 




150 


Arg 


He 


Leu 


Lys 


Ser 


Gin His Ala Ala Met 


Phe 


Arg His Pne Gin 








155 


160 




lob 


Gin 


He 


Met 


Phe 


He 


Tyr Asn Asp Thr He 


Ala 


Ala Lys Gin Glu 










170 


175 




loU 


Lys 


Cys 


Lys 


Thr 


Phe 


lie Leu Arg Gin Leu 


Glu 


Vai Ala Gly Lys 










185 


190 




1 QC 

1273 


pi,, 


wet- 


Ser 


Glu 


Glu 


Asp Vai Asn Asp Met 


Leu 


his tain «iy iiys 










200 


205 




210 


Trp 


Glu 


Vai 


Phe 


Asn 


Glu Ser Leu Leu Thr 


Glu 


He Asn He Thr 










?15 


220 




225 


Lys 


Ala 


Gin 


Leu 


Ser 


Glu He Glu Gin Arg 


His 


Lys Glu Leu Vai 










230 


235 




240 


Asn 


Leu 


Glu 


Asn 


Gin 


He Lys Asp Leu Arg 


Asp 


Leu Phe He Gin 










245 


250 


255 


He 


Ser 


Leu 


Leu 


Vai 


Glu Glu Gin Gly Glu 


Ser 


He Asn Asn He 










260 


265 




270 


Glu 


Met 


Thr 


Vai 


Asn 


Ser Thr Lys Glu Tyr 


Vai 


Asn Asn Thr Lys 










275 


280 




285 


Glu 


Lys 


Phe 


Gly Leu 


Ala Vai Lys Tyr Lys 


Lys 


Arg Asn Pro Cys 










290 


295 




300 


Arg 


Vai 


Leu 


cys 


Cys 


Trp Cys Cys Pro Cys 


cys 


Ser Ser Lys 










305 


310 







<210> 412 
<211> 143 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: LI : 007401 . 1 .orf 2 :2000MAY01 
<400> 412 

Met Ala Ser Glu Ser Asp Thr Glu Glu Phe Tyr Asp Ala Pro Glu 
1 5 10 15 

Asp Vai His Leu Gly Gly Gly Tyr Pro Vai Gly Ser Pro Gly Lys 

20 25 30 

Vai Gly Leu Ser Thr Phe Lys Glu Thr Glu Asn Thr Ala Tyr Lys 

35 40 45 
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Val Gly Asn 


Glu Ser 


Pro 


Val Gin Glu 


Leu 


Lys Gin Asp 


Val 


Ser 


50 






55 


Ser Gin 






60 


Asn Lys lie 


He Glu 


Ser 


He He Glu 


GlU 


Lys 


Val 


Leu 


65 




i 


• 70 








75 


Gin Leu Glu 


Asp Asp 


Ser 


Leu Asp Ser 


Thr Gly Lys 


Glu 


Leu 


Ser 




80 




• 


85 








90 


Asp Gin Ala 


Thr Ala 


Ser 


Pro lie Val 


Ala 


Arg Thr 


Asp 


Leu 


Ser 


95 






100 








105 


Asn lie Pro Gly Leu 


Leu 


Ala He Asp 


Gin 


Val Leu 


Pro 


Glu 


Glu 




110 




115 








120 


Ser Gin Lys 


Ala Glu 


Ser 


Gin Asn Thr 


Phe 


Glu Glu 


Thr 


GlU 


Leu 


125 






130 


i 






135 


Glu Phe Lys 


Lys Met 


Leu 


Ser Phe 












140 
















<210> 413 


















<211> 122 


















<212> PRT 


















<213> Homo sapiens 
















<220> 


















<221> mis cofeature 
















<223> Incyte ID No: 


LI : 47 6342.1. or f 1 :200 0MAY01 








<400> 413 


















Gly Arg Thr 


Arg Val 


Ser Gly Pro Val Arg Asn Asn Thr Asp 


Pro 


1 


5 






10 








15 


lie His Ala 


Gly Arg 


Arg 


Trp Ser Ser 


Ser 


Ser Pro 


Gly Arg 


Pro 




20 






25 






Thr 


30 


Cys Ala Arg 


Ser Ser 


Trp 


Arg cys Ser 


Cys 


Sfer His 


Thr 


Thr 




35 




40 








45 


Ala Gly Arg 


Arg Arg 


Arg 


Trp Trp Arg 


Arg 


Pro Gly Cys 


Ala 


Trp 




50 






55 








60 


Ala Arg Ala 


Ser Thr 


Thr 


Arg Ser Pro 


Ala 


Ser Pro 


Thr 


Ala 


Ser 


65 




70 








75 


Ala Ala Thr 


Ser Ala 


Ser Arg Arg Thr Ala Gly Gly Pro 


Pro 


Ala 




80 






85 








90 


Thr Ala Thr 


Ser Ala 


Thr 


Ala Gly Ala Arg 


Arg Arg Ala Lys 


Gin 




95 






100 








105 


Ser Ser Ser 


Asn Thr 


Leu Gly Leu Pro 


Glu 


Leu Asn 


Ser 


Ser 


Ser 




110 






115 








120 


Thr Lys 


















<210> 414 


















<211> 86 


















<212> PRT 


















<213> Homo sapiens 
















<220> 


















<221> misc_: 


feature 
















<223> Incyte ID No: 


LI : 1072759 .l.orf 2 :2000MAY01 








<400> 414 


















Arg Pro Glu Glu Asp 


Met 


Arg Gin Leu 


Val 


His Gly 


Ser 


Gin 


Arg 


1 


5 




10 








15 


Gly Thr Leu Arg Lys 


Met 


Gly Leu Gin 


Pro 


Arg His 


Ser 


Ser 


Leu 




20 




25 








30 


Trp Cys Gin 


Phe Leu 


Val 


Gly Met Val 


Thr 


Thr Phe 


Trp 


Lys 


Gin 


35 




40 








45 


Gly Ala He 


He Ala 


Leu 


Val Ser Arg 


Arg 


Trp Lys 


Val 


Thr 


Arg 




50 






55 








60 


Lys Gly Trp 


Gin Cys 


Gin 


Val Arg Thr 


Thr 


Leu Ala 


Cys 


Arg 


Leu 




65 




70 








75 


Leu Asp Cys 


He Leu 


Pro 


Pro Asn Ser 


Tyr 


Asn 










80 






85 
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<210> 415 
<211> 213 
<212> PRT 

<213> Homo sapiens , 
<220> 

<221> misc feature 

<223> Incyie ID No: LG:998857 . 1 .orf3 :2000FEB18 
<400> 415 



Arg 


Gin 


Trp 


Gin Leu 


Leu 


Cys 


Cys Pro Cys Cys Cys Gly Gly 


Cys 


1 






5 






10 


15 


Trp 


Ala 


Arg 


Val Glu 


Val 


Gin 


Gly Trp Gly His Gly' Leu Ala 


Pro 








20 






25 


30 


Gly 


Leu 


Gly 


Trp Ser 


Cys 


Pro 


Ala Gly Pro Gly Gly His Arg 


Gly 








35 






• 40 


45 


Cys 


Gly 


Cys 


Pro Gly 


Val 


Gly 


Arg Thr Trp Lys Ala Ala Arg 


Ser 








50 






55 


60 


Arg 


Gly 


Ser 


Trp Val 


Ser 


Trp 


Val Ser Trp Gly Thr Trp Arg 


Ser 








65 






70 


75 


Trp 


Arg 


Ser 


Trp Gly 


Ser 


Trp 


Gly Ser Trp* Asp Ser Arg Ala 


Pro 








80 










Pro 


Pro 


Glu 


Pro Ser. 


Arg 


Thr 


Pro* Arg Gly Thr Asn Arg Ser 


Arg 








95 






100 


105 


Thr 


Ser 


Pro 


Pro Ala 


Cys 


Arg 


Arg Ala Ser Arg Ser Gly Pro 


Arg 








110 






115 


120 


Ala 


Pro 


Thr 


Pro Pro 


Pro 


Arg 


Pro Gin Arg Arg Ser Ala Ala 


Ala 








125 






130 


135 


Cys 


Gly 


Trp 


Arg Pro 


Arg 


Ser 


Gly Arg Pro Gly Arg Gly Leu 


Ser 








140 






145 


150 


Trp 


Arg 


Pro 


Arg Arg 


Arg 


Pro 


Gly Ala Arg Gin Arg Ser Trp 


Ser 








155 






160 


165 


Ala 


Ser 


Phe 


Gly Arg 


Cys 


Arg 


Cys Arg Ala Arg Arg Pro Thr 


Ala 








170 






175 


180 


Cys 


Gly 


Ala 


Gly Ser 


Asp 


Gly 


Leu Ala Leu Ala Gly Thr Arg 


Ala 








185 






190 


195 


Ser 


Arg 


Arg 


Ala Arg 


Cys 


Arg 


Arg Ser Arg Ser Arg Ala Gly 


Cys 








i 200 






205 


210 


Gly 


Gly 


Ser 













<210> 416 
<211> 263 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LG:482261 . 1 .orf 1 :2000FEB18 
<400> 416 



Leu 


Leu Asn 


Phe 


Val 


Ser 


Leu Ser Leu 


Phe 


Leu Phe Ser 


Phe 


Pro 


1 






5 






10 






15 


Arg 


Leu His 


Arg Glu 


Gly 


Glu Ser Leu 


Arg 


Val Val Tyr Gin 


Ala 








20 






25 






30 


Gly 


Ser Pro 


Thr 


Ser 


Leu 


Ala Pro Asn 


Thr 


Val Ser Ser 


Asn 


Pro 








35 






40 






45 


Gly 


Glu Val 


Thr 


Pro 


Glu 


Arg Gly Arg 


Cys 


Glu Glu Arg 


Ser 


Val 








' 50 






55 






60 


Gin 


Glu Leu 


Pro 


Arg 


Thr 


Cys Gly Arg 


Pro 


Cys Gly Lys 


Leu 


Val 








65 






70 






75 


His 


Ser Glu 


His 


Ser 


Arg 


Asp Thr Met 


Gly 


Gin Ser Lys 


Ser 


Lys 








80 






85 






90 


His 


Ser Ala 


Tyr 


Leu 


His 


Phe He Lys 


Leu 


Leu Leu Lys 


Arg 


Ala 








95 




100 






105 


Gly 


lie Lys 


Ala 


Ser 


Thr 


Glu Asn Leu 


He 


Thr Leu Phe 


Pro 


Thr 








110 






115 






120 
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Val 


Glu 


Gin 


Tyr Cys 
125, 


Pro Trp Phe 


Pro 


GlU 
130 


His 


Gly Thr Met Asp 
135 


Phe 


Lys 


Asp 


Trp Glu 


Gin Val Gly lie Ala 


Leu 


Lys Gin Val Cys 




140 






145 




150 


Lys 


Glu 


Gly 


Lys Phe 


He Pro Leu 


Thr 


Ala 


Trp 


Ser Asn Trp Ala 




155 






160 




165 


He 


Val 


Lys 


Ala Ala 


Ser Glu Pro 


Phe Gin 


Ser 


Glu Asn Glu Ala 






170 






175 




180 


Tyr 


Pro 


Pro 


Ala Glu 


Arg He Ser 


Ala 


Glu 


Glu 


Gly Gly Asp Ala 






185 




190 




195 


Ala 


Glu 


Gly 


Gly Glu 


Asp Ser Glu 


Glu 


Asp 


Phe 


GjLuGlu Asn Thr 






200 






205 




210 


Asp 


Lys 


Pro 


Gly Asp 


Glu Leu He 


Ser 


Phe 


Glu 


Glu His Val Gly 




215 






220 




225 


Pro 


Ser 


Ala 


Ala Pro 


Lys He Glu Lys 


Pro 


Tyr 


Met Pro Arg Cys 








230 






235 




240 


Leu 


Lys 


Gin 


Arg Arg 


Ala Leu Arg 


Ser 


Ser 


Arg 


Leu Leu He Gly 






245 






250 




255 


He 


He 


Arg 


Ser Gly 
260 


Arg Leu Gin 











<210> 417 

<211> 175 ' 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LG : 480328 . 1 . orfl : 200 0PEB18 
<400> 417 



Val 


Arg 


cys 


Gly 


Gly 


Gin Ser Gly 


Ser 


Arg 


Cys 


Glu Arg 


Val 


Pro 


1 








5 








10 










15 


Gin 


Lys 


Lys 


Thr 


Trp 


Leu Pro 


Lys 


Cys 


Leu 


Ser 


Pro 


Ser 


Ala 


Ser 






20 








25 










30 


Leu 


Gly 


Leu 


Ala 


Leu 


Ala Val 


Ala Gly 


Gly 


Met 


Val 


Asn 


Ser 


Ala 








35 








40 










45 


Leu 


Cys 


Asn 


Val 


Asp 


Ala Gly His Arg 


Ala 


Ala 


He 


Phe 


Asp 


Gin 








50 








55 










60 


Phe 


Arg 


Gly Val 


Gin 


Asn He 


Val 


Val 


Gly Glu 


Gly Thr 


His 


Phe 








65 








70 










75 


Leu 


He 


Pro 


Cys 


Val 


Gin Lys 


Pro 


He 


He 


Phe 


Asp 


Cys 


Cys 


Ser 








80 








85 










90 


Gin 


Pro 


Arg 


Ser 


Ala 


Pro Val 


He 


Thr 


Gly 


Ser 


Lys 


Asp 


Leu 


Gin 








95 








100 










105 


Asn 


Val 


Asn 


He 


Thr 


Leu Cys 


He 


Leu 


Phe 


Arg 


Pro 


He 


Thr 


Ser 










110 








115 










120 


Gin 


Leu 


Pro 


Arg 


He 


Phe Thr 


Ser 


He 


Gly Glu 


Asp 


Tyr 


Asp 


Glu 








125 








130 










13 5 


Cys 


Val 


Leu 


Pro 


Phe 


He Thr 


Thr 


Glu 


He 


Leu 


Lys 


Ser 


Leu 


Val 








140 








145 










150 


Ala 


Arg 


Phe 


Asp 


Ala 


Gly Glu 


Leu 


He 


Thr 


Gin 


Arg 


GlU 


Leu 


Val 






155 






160 










165 


Ser 


Ser 


Gin 


Val 


Ser 


Asn Asn 


Leu 


Met 


Glu 













170 175 

<210> 418 
<211> 272 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: LG:311197 *l.orf2 :2000MAY19 
<400> 418 

Gin Val Pro Phe Pro Ser His He Pro He Tyr Val Cys Leu Cys 
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BNSOOC1D: <WO 0162927A2_I_> 



WO 01/62927 gjfa 
1 ^5 



Val 


Gin 


Gin 


Thr Pro 


Ala 


Phe 


Ala 


Thr 


Phe 


Thr 


Phe 


20 

Ala Ser 


Trp 


Glu 


Leu 


Val 








35 








Asn 


Glu 


Glu 


Gin Gin 


Lys 


Asp 


Val 


Thr 








50 










Leu 


Hi's 


Val 


Gly Gly 


Val 


Met 


Leu 


Lys 








65 










He 


Ser 


Gin 


Asp Trp 


Ser 


Asp 


Phe 


Ala 








80 










His 


Cys 


Trp 


Leu Leu 


Lys 


Thr 


His 


Tip 








95 










Val 


Gin 


Ala 


Asp Ala 


Lys 


Leu 


Leu 


Phe 








110 








Leu 


Arg 


Leu 


Arg Leu 


Pro 


Asn 


Leu 


Lys 








125 










Ser 


Phe 


Ser 


Ala Val 


Val 


Phe 


Lys 


Ala 








140 








He 


Leu 


Asn 


He Arg 


Arg 


Ser 


Glu 


Glu 








155 








Ser 


Gly 


Asp 


Tyr Phe 


Lys 


Lys 


Lys 


Lys 








170 










Lys 


Glu 


Pro 


He He 


Glu 


Asp 


lie 


Leu 








185 








Thr 


Ala 


Ser 


Gly Ser 


Ser 


Val 


Ser 


Pro 








200 










Met 


Thr 


Pro 


He Tyr 


Asp 


Pro 


He 


Asn' 








215 








Thr 


Met 


Thr 


Trp Phe 


Ser 


Asp 


Ser 


Pro 








23 0 








Ser 


He 


Leu 


Ala Phe 


Ser 


Gin 


Pro 


Pro 








245 










Ala 


Asp 


Met 


Tyr Gin 


Pro 


Arg 


Ser 


Leu 



260 
Asn Ala 



<210> 419 
<211> 167 
<212> PRT 
<213> Homo sapiens 



PCT/US01/06059 



10 




15 


Met 


Leu 


Ser Ser Thr Asp 


25 




30 


Val 


Arg 


Val Asp His Pro 1 


40 




45 


Leu 


Arg 


Val Ser Gly Asp 


55 




60 


Leu 


Val 


Glu Gin He Asn 


70 




75 


Leu 


Trp 


Trp Glu Gin Lys 


85 




90 


Thr 


Leu 


Asp Lys Tyr Gly 


100 




105 


Thr 


Pro 


Gin His Lys Met 


115 




120 


Met 


Val 


Arg Leu Arg Val 


130 




135 


Val 


Ser 


Asp lie Cys Lys 


145 




150 


Leu 


Ser 


Leu Leu Lys Pro 


160 




165 


Lys 


Lys 


Asp Lys Asn Asn 


175 




180 


Asn 


Leu 


Glu Ser Ser Pro 


190 




195 


Gly 


Leu 


Tyr Ser Lys Thr 


205 




210 


Gly 


Thr 


Pro Ala Ser Ser 


220 




225 


Leu 


Thr 


Glu Gin Asn Cys 


235 




, 240 


Gin 


Ser 


Pro Glu Ala Leu 


250 




255 


Val 


Asp 


Thr Ala Lys Leu 


265 




270 



<220> 

<221> misc_feature 

<223> Incyte ID No: LG: 1054883 . 1 . orfl : 2000MAY19 
<220> 

<221> unsure 1 
<222> 2 

<223> unknown or other 



<400> 419 






Arg 


Xaa 


Arg Gin Pro 


Leu 


Leu 


1 




5 






Ser 


Thr 


His Gly Arg 
20 


Ser 


Met 


Phe 


Leu 


Lys Phe Trp 
35 


Thr 


Tyr 


Thr 


Gin 


Asn Gly Asp 
50 


Cys 


Ser 


Val 


Gly 


Gin Gin He 


Lys 


Thr 






65 




Glu 


Cys 


Met Glu Thr 


Leu 


Lys 






80 




Gin 


Tyr 


Lys Val Cys 
95 


Ser 


Pro 



Gin 


Ser 


His 


Pro 


Glu Ala Asp Trp 






10 




15 


Arg 


Lys 


Leu 


He 


Val Arg Phe He 




25 




30 


Thr 


Val 


Arg 


Ala 


Ser Thr Asn Leu 






40 




45 


Gin 


Cys 


He 


Tyr 


Gin Val Thr Glu 




55 




60 


He 


Phe 


Leu 


Phe 


Tyr Ser Tyr Tyr 






70 




75 


Glu 


Thr 


Cys 


Leu 


Tyr Asn Ala Thr 






85 




90 


Arg 


Asn 


Asp 


Arg 


Pro Asp Ala Cys 






100 




105 
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^Wkla Ala Thr Thr Val Phe Glu Il^Rrc 



Tyr Asn Pro Ser Glu Pfo^Ala Ala Thr Thr Val Phe Glu IlWfrg 

110 115 120 

Thr Gly Leu Leu Leu Gly Asp Thr Ser Lys lie lie Thr Arg Thr ' 

125 , 130 135 

Glu Glu Lys Glu lie Pro Lys Gin lie Thr Leu Arg Phe Asp Ala 

140 145 150 

Cys Ala Ala lie Asn Ser Lys Lys Leu Glu lie Gly Cys Gly Ser 

155 160 165 

Leu Asn 



<210> 420 

<211> 59 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: LG : 399395 . 1 . orf 2 : 2000MAY19 
<400> 420 



Ser Gin 


His 


Phe 


Gly 


Arg Pro Arg 


Gin 


Glu 


Asp 


His Leu 


Ser 


Pro 


1 






5 




10 








15 


Gly Val 


Gin 


Asp 


Gin 


Pro Gly Gin 


His 


Ser 


GlU 


Thr Leu 


Thr 


Gin 




20 




25 








30 


Lys lie 


Lys 


Arg 


Lys 


Asp Lys Asn 


Thr 


Arg 


Met 


Ala Lys 


Gin 


Thr 








35 






40 








45 


Ser Val 


His 


Gin 


Pro 


Gly Gly He 


Leu 


Tyr 


Ser 


Leu Leu 


Lys 










50 




55 











<210> 421 
<211> 216 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: LG : 380497 . 2 . orf 1 : 2000MAY19 



<400> 421 




















Ser 


Pro 


Arg Pro 


Leu 


Gin 


Ser 


Ala Gly 


Glu 


Gly 


Val Thr His 


Val 


1 




5 










10 






15 


Leu 


He 


Leu Leu 


GlU 
20 


Ser 


Pro 


Ala 


Arg 


Pro 
25 


Val 


Ala Ala Val 


Thr 
30 


Gin 


Val 


Gin Arg 


Arg 
35 


Arg 


Tyr 


His 


Arg 


Leu 
40 


Ser 


Asp Met Ser 


Met 
45 


Leu 


Ala 


Glu Arg 


Arg 


Arg 


Lys 


Gin 


Lys 


Trp 


Ala 


Val Asp Pro 


Gin 








50 










55 




60 


Asn 


Thr 


Ala Trp 


Ser 


Asn 


Asp 


Asp 


Ser 


Lys 


Phe 


Gly Gin Arg 


Met 






65 






70 






75 


Leu 


Glu 


Lys Met 


Gly 


Trp 


Ser 


Lys 


Gly 


Lys 


Gly 


Leu Gly Ala Gin 








80 










85 






90 


Glu 


Gin 


Gly Ala 


Thr 


Asp 


His 


He 


Lys 


Val 


Gin 


Val Lys Asn 


Asn 






95 








100 






105 


His 


Leu 


Gly Leu 


Gly 


Ala 


Thr 


He 


Asn 


Asn 


Glu 


Asp Asn Trp 


He 






110 










115 






120 


Ala 


His 


Gin Asp 


Asp 


Phe 


Asn 


Gin 


Leu 


Leu 


Ala 


Glu Leu Asn 


Thr 






125 










130 






135 


Cys 


His 


Gly Gin 


Glu 


Thr 


Thr 


Asp 


Ser 


Ser 


Asp 


Lys Lys Glu 


Lys 






140 








145 






150 


Lys 


Ser 


Phe Ser 


Leu 


Glu 


GlU 


Lys 


Ser 


Lys 


He 


Ser Lys Asn 


Arg 






155 








160 






165 


Val 


His 


Tyr Met 


Lys 


Phe 


Thr 


Lys 


Gly 


Arg 


cys 


Gin Ser Leu 


His 








170 








175 






180 


Ser 


Arg 


Gly Glu 


Arg 


Asn 


His 


Asp 


Asn 


Gin 


Arg 


Leu His His 


Pro 








185 








190 






195 


Gly 


Val 


Leu Cys 


Cln 


Ala 


Asp 


Gly 


Ser 


Thr 


Glu 


Glu Gin Ala 


Pro 
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)0 205 210 

Gly Ser Ser Ser Arg Val 
215 

<210> 422 
<211> 162 
<212> PRT 

<213> Homo' sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: LI:272913 .22 .orfl :2000MAY01 
<400> 422 

Ser Ala Thr Ala Ala Ala Pro Arg Glu Arg Lys Met Ala Pro His 
1 5 10 15 

Gly Pro Gly Ser Leu Thr Thr Leu Val Pro Trp Ala Ala Ala Leu 
20 25 30 

Leu Leu Ala Leu Gly Val Glu Arg Ala Leu Ala Leu Pro Glu lie 
35 40 45 

Cys Thr Gin Cys Pro Gly Ser Val Gin Asn Leu Ser Lys Val Ala 
50 55 60 

Phe Tyr Cys Lys Thr Thr Arg Glu * Leu Met Leu His Ala Arg Cys 
65 70 75 

Cys Leu Asn Gin Lys Asp Pro Gly Pro Asn Phe His Gin Ala His 
80 85 90 

Thr Thr Val lie lie Asp Leu Gin Ala Asn Pro Leu Lys Gly Asp 
95 100 105 

Leu Ala Asn Thr Phe Arg Gly Phe Thr Gin Leu Gin Thr Leu lie 
110 115 120 

Leu Pro Gin His Val Asn Cys Ser Gly Gly lie Asn Ala Trp Asn 
125 130 135 

Thr lie Thr Ser Tyr lie Asp Asn Gin lie Cys Gin Gly Gin Lys 
140 145 150 

Asn Leu Cys Asn Asn Thr Gly Asp Pro Glu Met Cys 
155 160 



> 
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